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Preface to the Sixth Edition 


As the title suggests, this book is about fundamental things that one might 
expect to remain fairly the same. Yet astronomy has evolved enormously over 
the last few years, and only a few chapters of this book have been left unmod¬ 
ified. 

Since the book is used also by many amateurs, the introductory chapter 
has been extended to give a brief summary of different celestial objects to 
“soften” the jump to rather technical topics. 

The chapter on the solar system was very long. It has now been split into 
two separate chapters. Chapter 7 deals with general properties of the solar 
system. Individual objects are discussed in Chap. 8, which is more prone 
to change when new data will accumulate. Also, new data on exoplanets is 
obtained at an increasing rate. Therefore exoplanets are given a chapter of 
their own; it is at the end of the book, since it is closely related to astrobiology, 
already included in the previous edition. These last chapters may change more 
than the rest of the book in the future. 

These changes mean that the numbering of formulas and figures has 
changed quite extensively after the previous version of the book. 

Cosmology and galactic astronomy have still been evolving rapidly. There¬ 
fore there are many revisions to the chapters on the Milky Way, galaxies, and 
cosmology. 

In addition, several other chapters contain smaller revisions and many of 
the previous images have been replaced with newer ones. 

Helsinki, Finland Hannu Karttunen 

April 2016 Pekka Kroger 

Heikki Oja 
Markku Poutanen 
Karl Johan Donner 


v 



Preface to the First Edition 


The main purpose of this book is to serve as a university textbook for a first 
course in astronomy. However, we believe that the audience will also include 
many serious amateurs, who often find the popular texts too trivial. The lack 
of a good handbook for amateurs has become a problem lately, as more and 
more people are buying personal computers and need exact, but comprehen¬ 
sible, mathematical formalism for their programs. The reader of this book is 
assumed to have only a standard high-school knowledge of mathematics and 
physics (as they are taught in Finland); everything more advanced is usually 
derived step by step from simple basic principles. The mathematical back¬ 
ground needed includes plane trigonometry, basic differential and integral 
calculus, and (only in the chapter dealing with celestial mechanics) some vec¬ 
tor calculus. Some mathematical concepts the reader may not be familiar with 
are briefly explained in the appendices or can be understood by studying the 
numerous exercises and examples. However, most of the book can be read 
with very little knowledge of mathematics, and even if the reader skips the 
mathematically more involved sections, (s)he should get a good overview of 
the field of astronomy. 

This book has evolved in the course of many years and through the work 
of several authors and editors. The first version consisted of lecture notes by 
one of the editors (Oja). These were later modified and augmented by the 
other editors and authors. Hannu Karttunen wrote the chapters on spherical 
astronomy and celestial mechanics; Vilppu Piirola added parts to the chapter 
on observational instruments, and Goran Sandell wrote the part about radio 
astronomy; chapters on magnitudes, radiation mechanisms and temperature 
were rewritten by the editors; Markku Poutanen wrote the chapter on the so¬ 
lar system; Juhani Kyrolainen expanded the chapter on stellar spectra; Timo 
Rahunen rewrote most of the chapters on stellar structure and evolution; Ilkka 
Tuominen revised the chapter on the Sun; Kalevi Mattila wrote the chapter 
on interstellar matter; Tapio Markkanen wrote the chapters on star clusters 
and the Milky Way; Karl Johan Donner wrote the major part of the chap¬ 
ter on galaxies; Mauri Valtonen wrote parts of the galaxy chapter, and, in 
collaboration with Pekka Teerikorpi, the chapter on cosmology. Finally, the 
resulting, somewhat inhomogeneous, material was made consistent by the 
editors. 

The English text was written by the editors, who translated parts of the 
original Finnish text, and rewrote other parts, updating the text and correcting 
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errors found in the original edition. The parts of text set in smaller print are 
less important material that may still be of interest to the reader. 

For the illustrations, we received help from Veikko Sinkkonen, Mirva 
Vuori and several observatories and individuals mentioned in the figure cap¬ 
tions. In the practical work, we were assisted by Arja Kyrolainen and Merja 
Karsma. A part of the translation was read and corrected by Brian Skiff. We 
want to express our warmest thanks to all of them. 

Financial support was given by the Finnish Ministry of Education and Suo- 
malaisen kirjallisuuden edistamisvarojen valtuuskunta (a foundation promot¬ 
ing Finnish literature), to whom we express our gratitude. 

Flelsinki, Finland Flannu Karttunen 

June 1987 Pekka Kroger 

Heikki Oja 
Markku Poutanen 
Karl Johan Donner 
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Introduction 


On a dark, cloudless night, at a distant location 
far away from the city lights, the starry sky can 
be seen in all its splendour (Fig. 1.1). It is easy to 
understand how these thousands of lights in the 
sky have affected people throughout the ages. 

As long as human beings have existed, they 
have certainly wondered the sky. In the sky, 
ancient people saw figures related to religious 
myths and omens sent by the gods. However, 
already a couple of millennia ago the real as¬ 
tronomy started to evolve, separating itself from 
religions and astrological superstitions. People 
started to study the sky for its own sake. 


1.1 Celestial Objects 

In the 17th century people started to realise that 
the Earth is not the centre of the Universe. About 
the same time emerged the current view that stars 
are celestial bodies similar to our Sun. The seem 
to be faint dots only due to their huge distances. 
We now know that the Sun and stars are hot glow¬ 
ing balls of gas, producing energy when fusion 
reactions convert hydrogen to helium and also to 
other heavier elements (Chap. 11). 

Although stars actually move at enormous 
speeds, the sky does not seem to change even 
in thousands of years, due to the vast distances 
of the stars. In addition to the Sun and the Moon 
there are some other objects that move with re¬ 
spect to the stars. Since the antiquity, these mov¬ 
ing objects have been called planets, from the 
Greek word meaning a wanderer. 


The rapid motions of the planets reveal that 
they are much closer than the stars. Indeed they 
are objects orbiting the Sun. According the cur¬ 
rent definition (Chap. 7) there are eight planets 
orbiting the Sun: Mercury, Venus, Earth, Mars, 
Jupiter, Saturn, Uranus and Neptune. In addition 
to these relatively big bodies a lot of different 
smaller objects move around the Sun: dwarf plan¬ 
ets, asteroids, comets and meteoroids (Chap. 8). 
Mosts planets also have their own satellites or 
moons. Planets, moons and minor bodies do not 
produce light by nuclear fusion; instead they 
shine just by reflecting the sunlight. 

At the centre of the solar system shines the 
Sun, producing energy by fusion reactions 
(Chap. 13). It is the nearest star, and studying its 
properties reveals also a lot about other stars. 

A few thousand stars can be seen by the naked 
eye, but even a small telescope reveals millions 
of them. Based on their properties, stars can be 
divided into different categories. A great major¬ 
ity of them is main-sequence stars, like our Sun. 
Some of them, though, are much bigger, giants 
or supergiants, and some are much smaller, white 
dwarfs. Different stars are usually related to dif¬ 
ferent evolutionary stages in the lives of stars. 
Many stars are variable stars, whose brightness 
varies with time. 

Rather recently found objects are compact 
stars', neutron stars and black holes (Chap. 15). 
Their material is squeezed into such a com¬ 
pressed form and their gravitational field is so 
strong that Einstein’s general theory of relativity 
must be used to describe their matter and space 
around them. 
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1 Introduction 


Fig. 1.1 The starry sky in 
all its splendour can be 
seen only far away from 
the light pollution of cities. 
(Pekka Parviainen) 



Fig. 1.2 The Pleiades is 


one of the best-known open 


star clusters. The six 

- •* ♦ ♦ 

• ♦ 

brightest stars can easily be 


seen with the naked eye. 


Photographs reveal also 

.• " ilk ’ 

interstellar gas reflecting 


the light of the stars. 

♦ . 

(NASA, ESA, 


AURA/Caltech, Palomar 


Observatory ) 



i • p 


. • 




♦ ‘ - 


♦ ■ • 


The Sun is a solitary star. Many stars appear in 
pairs, they are binary stars, orbiting around their 
common centre of mass (Chap. 10). Also systems 
of several stars are relatively common. 

Bigger groups of stars are star clusters 
(Chap. 17). Open clusters (Fig. 1.2) usually con¬ 
tain a few tens or hundreds of stars, that were born 
in the same area, usually quite recently. Eventu¬ 
ally the stars will diverge to their own paths. 

Globular cluster (Fig. 1.3), on the other hand, 
may contain hundreds of thousands or millions of 
stars, which are usually very old. 


The interstellar space corresponds pretty well 
to our idea about a perfect vacuum. However, it 
is not totally empty but contains interstellar mat¬ 
ter, mainly hydrogen and helium, but also minute 
amounts of heavier elements, molecules and dust 
(Chap. 16). The interstellar medium does not 
fill the space as a uniform mist, but forms huge 
clouds (Fig. 1.4). 

New stars are bom by condensing from the in¬ 
terstellar matter. When the density, pressure and 
temperature of the condensing cloud have risen 
high enough, fusion reactions start and a new star 
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Fig. 1.3 The globular 
cluster Ml3 in the 
Hercules constellation 
contains over a million 
stars. The cluster can even 
be seen with the naked eye 
as a small nebulous spot. 
(Palomar Observatory) 



begins to radiate the energy released in the re¬ 
actions (Chap. 12). After millions or billions of 
years the energy resources will be exhausted. The 
evolution then depends on the mass of the star. 
The smallest stars just cool down and fade away, 
but more massive ones either eject part of their 
mass back to space as planetary nebula or ex¬ 
plode as supernovas. Thus matter converted by 
the nuclear reactions of the stars is mixed with 
the interstellar matter. 

All stars visible as separate objects to the 
naked eye or with binoculars belong to the Milky 
Way (Fig. 1.5, Chap. 18). The Milky Way is a 
system containing a couple of billion stars, a 
galaxy (Figs. 1.6 and 1.7, Chap. 19). It takes 
about 100,000 years to travel across the Milky 
Way with the speed of light. 

Milky Way is not the only galaxy, but just 
one of very many similar systems. Galaxies are 
the basic building blocks of the Universe. They 
do not spread out evenly but form small galaxy 
groups , bigger galaxy clusters and even bigger 
superclusters. 

Galaxies are observed close to the edge of the 
visible universe. Nuclei of some galaxies are seen 
as quasars', the most distant of them have radiated 
the light we detect now when the age of the Uni¬ 
verse was only one tenth of the current value. 


1.2 The Role of Astronomy 

Already a long time ago man was interested in 
celestial phenomena. Several bone carvings made 
by the Cro-Magnon men as early as 30,000 years 
ago have been found, possibly recording Lunar 
phases. In that case these calendars would be 
the oldest astronomical documents, predating the 
skill of writing by some 25,000 years. 

Agriculture required a good knowledge of 
the seasons. Religious rituals and prognostica¬ 
tion were based on the locations of the celestial 
bodies. Thus time reckoning became more and 
more accurate, and people learned to calculate 
the movements of celestial bodies in advance. 

During the rapid development of seafaring, 
when voyages extended farther and farther from 
home ports, position determination presented 
a problem for which astronomy offered a prac¬ 
tical solution. Solving these problems of navi¬ 
gation were the most important tasks of astron¬ 
omy in the 17th and 18th centuries, when the first 
precise tables on the movements of the planets 
and on other celestial phenomena were published. 
The basis for these developments was the discov¬ 
ery of the laws governing the motions of the plan¬ 
ets by Copernicus, Tycho Brahe, Kepler, Galilei 
and Newton. 
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1 Introduction 



Fig. 1.4 The North America nebula in the constellation 
of Cygnus is a large cloud of interstellar gas. The nebula 
appears brighter than the background because the radia¬ 
tion from the nearby stars makes it shine. The nebula is, 

Astronomical research has changed man’s 
view of the world from geocentric, anthropocen¬ 
tric conceptions to the modern view of a vast uni¬ 
verse where man and the Earth play an insignifi¬ 
cant role. Astronomy has taught us the real scale 
of the nature surrounding us. 

Modern astronomy is fundamental science, 
motivated mainly by man’s curiosity, his wish to 
know more about Nature and the Universe. As¬ 
tronomy has a central role in forming a scien¬ 
tific view of the world. “A scientific view of the 
world” means a model of the universe based on 
observations, thoroughly tested theories and log¬ 
ical reasoning. Observations are always the ulti¬ 
mate test of a model: if the model does not fit 
the observations, it has to be changed, and this 


however, very faint and difficult to observe visually. The 
brightest star on the right is a Cygni or Deneb. (Photo 
M. Poutanen and H. Virtanen) 


process must not be limited by any philosophical, 
political or religious conceptions or beliefs. 


1.3 Astronomical Objects of 
Research 

Modern astronomy explores the whole Universe 
and its different forms of matter and energy. As¬ 
tronomy can be divided into different branches in 
several ways, based e.g. on the object of research 
or the method used. 

The Earth (Fig. 1.10) is of interest to astron¬ 
omy for many reasons. Nearly all observations 
must be made through the atmosphere (Fig. 1 .9), 
and the phenomena of the upper atmosphere and 
magnetosphere reflect the state of interplanetary 
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Fig. 1.5 The Milky Way 
appears as a nebulous band 
stretching across the sky. 

A telescope reveals that it 
consists of myriads of stars 
as observed already by 
Galileo Galilei 400 years 
ago. The Milky Way is a 
flat disclike stellar system. 
Our solar system is close to 
the plane of the disc, and 
looking in the direction of 
the plane we see a lot of 
stars. But if we look away 
from the disc the stellar 
density is much lower. The 
disc contains also unevenly 
distributed interstellar gas 
and dust, which obscures 
the view in some 
directions. Near the lower 
edge of the picture the 
Milky Way seems to split 
into two branches because 
the distant stars are behind 
the intervening obscuring 
matter. (Pekka Parviainen) 



space. The Earth is also the most important ob¬ 
ject of comparison for planetologists. 

The Moon is still studied by astronomical 
methods, although spacecraft and astronauts have 
visited its surface and brought samples back to 
the Earth. To amateur astronomers, the Moon 
is an easy and interesting object for observa¬ 
tions. 

Space probes have already studied all plan¬ 
ets, many of their satellites, some asteroids and 


comets. The most distant planets, Uranus and 
Neptune, have been observed only by fly-bys, but 
all the other ones also by orbiters. Spacecraft have 
softlanded on Mars, Venus, Saturn’s moon Ti¬ 
tan and some minor bodies. Exploration by such 
probes has tremendously added to our knowledge 
about the conditions of these objects. Continuous 
monitoring of the planets, however, can still only 
be made from the Earth, and many small bodies 
of the solar system still await their spacecraft. 



6 


1 Introduction 


Fig. 1.6 The galaxy M3 1 
in the Andromeda 
constellation is a star 
system resembling our own 
Milky Way. Stars and 
interstellar matter 
concentrate in spiral arms. 
The shape of M31 is a 
round, flat disc, but due to 
the oblique view it looks 
oval. In good conditions 
the centre of the galaxy can 
be seen even with the 
naked eye as a faint 
nebulous spot. M31 has 
two small elliptic 
neighbour galaxies seen 
here as bright ellipses. M32 
is below the centre of M31 
and Ml 10 towards 
northeast from the centre. 
North is up. The dots are 
stars of the Milky Way. 
(Bill Schoening, Vanessa 
Harvey/REU pro- 
gram/NOAO/AURA/NSF) 



Fig.1.7 In addition to the 
big galaxies like the Milky 
Way there are numerous 
much smaller dwarf 
galaxies, which are often 
irregular in shape. One of 
them is the Large 
Magellanic Cloud, our 
nearest neighbour galaxy. 

It is easily seen with the 
naked eye, but it is close to 
the southern pole of the 
sky. (NOAO/Cerro Tololo 
Inter-American 
Observatory) 
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Fig. 1.8 The deep-field 
picture of the Hubble space 
telescope is a combination 
of several images exposed 
altogether over 11 days. 
The picture shows several 
galaxies that are the most 
distant ones known. When 
we are looking far to space 
we are also looking far to 
the past since the light 
proceeds at a finite speed. 
Thus many of the galaxies 
in the picture are also 
among the oldest known 
objects. When we compare 
them with the objects in 
our neighbourhood we can 
deduce how the galaxies 
evolved during billions of 
years. (NASA) 



Fig. 1.9 Although space 
probes and satellites have 
gathered remarkable new 
information, a great 
majority of astronomical 
observations is still 
Earth-based. The most 
important observatories are 
usually located at high 
altitudes far from densely 
populated areas. One such 
observatory is on Mt 
Paranal in Chile, which 
houses the European VLT 
telescopes. (Photo ESO) 



An astronomer can also specialise in study¬ 
ing several different fields like the Sun, differ¬ 
ent kinds of stars, star clusters, the Milky Way 
or galaxies (Fig. 1.11). 


The largest object of research is the whole 
Universe. Earlier this field, cosmology, belonged 
to theologists and philosophers, but in the 20th 
century it became an object of physical theories 
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and eventually of concrete astronomical observa¬ 
tions. 

Spherical astronomy is an old field of astron¬ 
omy studying the coordinate systems of the ce¬ 
lestial sphere and apparent positions and motions 
of the celestial objects. Until the 17th century as¬ 
tronomy was mainly spherical astronomy. 

When Isaac Newton published the fundamen¬ 
tal laws of mechanics in 1687 in his Principia 
mathematica, the motions of celestial objects got 
a physical explanation. That was the beginning 
of celestial mechanics, studying the motions from 
the planets of the Solar System and satellites or¬ 
biting the Earth to distant galaxies and galaxy 
clusters. 


Halfway the 19th century it was found out how 
spectra can reveal physical properties of celestial 
objects. This was the beginning of astrophysics, 
studying the physical phenomena of the stars. Re¬ 
sults from astrophysics are utilised particularly in 
the research of the Sun, stars and interstellar mat¬ 
ter. 

Astronomy can be divided into different bran¬ 
ches also by the wavelengths used. We can talk 
about radio, infrared, optical, ultraviolet, X-ray or 
gamma-ray astronomy, depending on the wave¬ 
length used in the observations. 

Astronomers study also particles coming from 
the space, like neutrinos and cosmic rays. Grav- 


Fig. 1.10 The Earth as 
seen from the Moon. 
Thanks to spaceflights we 
have seen clearly the 
planetary status of the 
Earth. The picture was 
taken by the Japanese 
Kaguya lunar orbiter in 
2007. Currently the Moon 
is the only celestial object 
outside the Earth visited by 
human beings, on the 
Apollo flights in 
1969-1972. (JAXA) 




Planets and solar system 
Active galactic nuclei 
Sun 

Interstellar matter, Milky Way 
Origin of stars and planets 
Observational instruments 


Fig. 1.11 Astronomy in the change. Although the num¬ 
bers of astronomical articles have increased in all sub- 
fields in the last few decades, the relative proportions have 
changed. Cosmology and galaxies are the greatest winners 
whereas the share of stellar research has decreased. The 


graph illustrates the relative numbers of articles in differ¬ 
ent fields of astronomy in the most influential journals in 
1981-2009. (Adapted from the New Worlds, New Hori¬ 
zons in Astronomy and Astrophysics, 2010, p. 120.) Pub¬ 
lished by the US National Science Academy 
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Fig. 1.12 The dimensions of the Universe 


itational waves are the most recent object of re¬ 
search. 

Astronomy and space research may seem 
to be related, although they are quite differ¬ 
ent things. Space research includes all activi¬ 


ties in the space, but only a minor fraction of 
that is astronomical research. Space research is 
mainly commercial services, like communica¬ 
tion, weather observations, navigation, remote 
sensing and environmental control, and also mil- 
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itary reconnaissance. Space astronomy is a field 
of astronomy that utilises observations made by 
satellites and space probes. 


1.4 The Scale of the Universe 

The masses and sizes of astronomical objects 
are usually enormously large. But to understand 
their properties, the smallest parts of matter, 
molecules, atoms and elementary particles, must 
be studied. The densities, temperatures and mag¬ 
netic fields in the Universe vary within much 
larger limits than can be reached in laboratories 
on the Earth (Fig. 1.12). 

The greatest natural density met on the Earth is 
22,500 kgm“ 3 (osmium), while in neutron stars 
densities of the order of 10 18 kgm~ 3 are possi¬ 
ble. The density in the best vacuum achieved on 
the Earth is only 10 -9 kgm -3 , but in interstellar 
space the density of the gas may be HP 31 kgm -3 
or even less. Modern accelerators can give parti¬ 
cles energies of the order of 10 13 electron volts 
(eV). Cosmic rays coming from the sky may have 
energies of over 10 2 ° eV. 

It has taken man a long time to grasp the vast 
dimensions of space. Already Hipparchos in the 
second century B.C. obtained a reasonably cor¬ 
rect value for the distance of the Moon. The scale 
of the solar system was established together with 
the heliocentric system in the 17th century. In 
the old geocentric system the distances of planets 
did not affect their apparent motions and could 


be chosen arbitrarily. In the heliocentric system 
this is no more possible. Thus the distances of 
the Solar System were known reasonably well as 
early as in the 15th century. Also serious attempts 
to determine stellar distances were made, but the 
first successful measurements were made only in 
the 1830’s, and decent estimates for the distances 
to the galaxies were obtained only in the 1920’s. 

We can get some kind of picture of the dis¬ 
tances involved (Fig. 1.4) by considering the time 
required for light to travel from a source to the 
retina of the human eye. It takes 8 minutes for 
light to travel from the Sun, 5 j hours from Nep¬ 
tune and 4 years from the nearest star. We can¬ 
not see the centre of the Milky Way, but the 
many globular clusters around the Milky Way 
are at approximately similar distances. It takes 
about 20,000 years for the light from the glob¬ 
ular cluster of Fig. 1.5 to reach the Earth. It 
takes 150,000 years to travel the distance from 
the nearest galaxy, the Magellanic Cloud seen on 
the southern sky (Fig. 1.7). The photons that we 
see now started their voyage when Neanderthal 
Man lived on the Earth. The light coming from 
the Andromeda Galaxy (Fig. 1.6) in the north¬ 
ern sky originated 2 million years ago. Around 
the same time the first actual human using tools, 
Homo habilis, appeared. The most distant objects 
known, the quasars, are so far away that their ra¬ 
diation, seen on the Earth now, was emitted long 
before the Sun or the Earth were born (Fig. 1.8). 




Spherical Astronomy 


2 


Spherical astronomy is a science studying astro¬ 
nomical coordinate frames, directions and appar¬ 
ent motions of celestial objects, determination of 
position from astronomical observations, obser¬ 
vational errors, etc. We shall concentrate mainly 
on astronomical coordinates, apparent motions of 
stars and time reckoning. Also, some of the most 
important star catalogues will be introduced. 

For simplicity we will assume that the ob¬ 
server is always on the northern hemisphere. Al¬ 
though all definitions and equations are easily 
generalised for both hemispheres, this might be 
unnecessarily confusing. In spherical astronomy 
all angles are usually expressed in degrees; we 
will also use degrees unless otherwise mentioned. 


2.1 Spherical Trigonometry 

For the coordinate transformations of spherical 
astronomy, we need some mathematical tools, 
which we present now. 

If a plane passes through the centre of a sphere, 
it will split the sphere into two identical hemi¬ 
spheres along a circle called a great circle 
(Fig. 2.1). A line perpendicular to the plane and 
passing through the centre of the sphere inter¬ 
sects the sphere at the poles P and P'. If a sphere 
is intersected by a plane not containing the cen¬ 
tre, the intersection curve is a small circle. There 
is exactly one great circle passing through two 
given points Q and Q' on a sphere (unless these 
points are antipodal, in which case all circles 
passing through both of them are great circles). 
The arc QQ' of this great circle is the shortest 


Srnall circle 



Fig. 2.1 A great circle is the intersection of a sphere and 
a plane passing through its centre. P and P' are the poles 
of the great circle. The shortest path from Q to Q' follows 
the great circle 

path on the surface of the sphere between these 
points. 

A spherical triangle is not just any three- 
cornered figure lying on a sphere; its sides must 
be arcs of great circles. The spherical trian¬ 
gle ABC in Fig. 2.2 has the arcs AB , BC and AC 
as its sides. If the radius of the sphere is r, the 
length of the arc AB is 

| AB\ = rc, [c] = rad, 

where c is the angle subtended by the arc AB 
as seen from the centre. This angle is called the 
central angle of the side AB. Because lengths of 
sides and central angles correspond to each other 
in a unique way, it is customary to give the central 
angles instead of the sides. In this way, the radius 
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of the sphere does not enter into the equations 
of spherical trigonometry. An angle of a spheri¬ 
cal triangle can be defined as the angle between 
the tangents of the two sides meeting at a vertex, 
or as the dihedral angle between the planes in¬ 
tersecting the sphere along these two sides. We 
denote the angles of a spherical triangle by cap¬ 
ital letters (A, B, C) and the opposing sides, or, 
more correctly, the corresponding central angles, 
by lowercase letters (a, b , c). 

The sum of the angles of a spherical triangle is 
always greater than 180 degrees; the excess 

E = A + B + C — 180° (2.1) 

is called the spherical excess. It is not a constant, 
but depends on the triangle. Unlike in plane ge¬ 
ometry, it is not enough to know two of the angles 
to determine the third one. The area of a spherical 
triangle is related to the spherical excess in a very 
simple way: 

Area = Er 2 , [£] = rad. (2.2) 

This shows that the spherical excess equals the 
solid angle in steradians (see Appendix A.l), sub¬ 
tended by the triangle as seen from the centre. 

To prove (2.2), we extend all sides of the tri¬ 
angle A to great circles (Fig. 2.3). These great 
circles will form another triangle A', congruent 
with A but antipodal to it. If the angle A is ex¬ 
pressed in radians, the area of the slice S(A) 



Fig. 2.2 A spherical triangle is bounded by three arcs of 
great circles, AB. BC and CA. The corresponding central 
angles are c, a, and b 


bounded by the two sides of A (the shaded 
area in Fig. 2.3) is obviously 2A/2n = A/n 
times the area of the sphere, 4nr 2 . Similarly, the 
slices S(B ) and S(C) cover fractions B/n and 
CIn of the whole sphere. 

Together, the three slices cover the whole sur¬ 
face of the sphere, the equal triangles A and A' 
belonging to every slice, and each point outside 
the triangles, to exactly one slice. Thus the area of 
the slices S(A), S(B) and S(C) equals the area of 
the sphere plus four times the area of A, .4(A): 

A + B + C , , 

- 4nr~ — 4nr~ + 4.4(A), 

n 

whence 

.4(A) — {A + B + C — n)r 2 = Er 2 . 

As in the case of plane triangles, we can de¬ 
rive relationships between the sides and angles of 
spherical triangles. The easiest way to do this is 
by inspecting certain coordinate transformations. 

Suppose we have two rectangular coordinate 
frames Oxyz and Ox’y’z' (Fig. 2.4), such that 
the x'y'z' frame is obtained from the xyz frame 
by rotating it around the x axis by an angle x • 

The position of a point P on a unit sphere is 
uniquely determined by giving two angles. The 
angle i [r is measured counterclockwise from the 
positive x axis along the xy plane; the other an¬ 
gle 6 tells the angular distance from the xy plane. 



Fig. 2.3 If the sides of a spherical triangle are extended 
all the way around the sphere, they form another trian¬ 
gle A', antipodal and equal to the original triangle A. The 
shaded area is the slice 5(A) 
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Fig. 2.4 The location of a point P on the surface of a unit 
sphere can be expressed by rectangular xyz coordinates or 
by two angles, i/r and 6. The x'y'z' frame is obtained by 
rotating the xyz frame around its x axis by an angle / 

In an analogous way, we can define the angles t/r' 
and O', which give the position of the point P in 
the x'y'z' frame. The rectangular coordinates of 
the point P as functions of these angles are: 

X = COS t/r COS0, x' — cosi/r' COS0', 

y = sin i/r cos 0, y' — sin \jr' cos O', (2.3) 

Z = sin$, z' = sin O'. 

We also know that the dashed coordinates are ob¬ 
tained from the undashed ones by a rotation in the 
yz plane (Fig. 2.5): 

x' — x, 

y' = y cos x + z sin x, (2.4) 

z! = ~y sin / + zcosx- 

By substituting the expressions of the rectangular 
coordinates (2.3) into (2.4), we have 

cos i/s' cos O’ = cos ijr cos 0 , 

sin i p-' cos 0' = sin t/r cos 0 cos x + sin 0 sin x, 

sin O' = — sin t/r cos 0 sin x + sin 0 cos x ■ 

(2.5) 

In fact, these equations are quite sufficient for 
all coordinate transformations we may encounter. 



Fig. 2.6 To derive triangulation formulas for the spher¬ 
ical triangle ABC, the spherical coordinates t/r, 0, i fr' 
and 9' of the vertex C are expressed in terms of the sides 
and angles of the triangle 

However, we shall also derive the usual equations 
for spherical triangles. To do this, we set up the 
coordinate frames in a suitable way (Fig. 2.6). 
The z axis points towards the vertex A and the 
z' axis, towards B. Now the vertex C corresponds 
to the point P in Fig. 2.4. The angles x[r, 0, \[r', O' 
and x can be expressed in terms of the angles and 
sides of the spherical triangle: 

,/r = A- 90°, 0 = 90°-b, 

y = 90°-B, O' = 90° — a, x=c. 

( 2 . 6 ) 
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Substitution into (2.5) gives 

cos(90° - B) cos(90° - a) 

= cos(A — 90°) cos(90° — b), 
sin(90° - B) cos(90° — a) 

= sin(A — 90°)cos(90° — b)cosc 
+ sin(90° — b ) sine, 
sin(90° - a) 

= — sin(A — 90°) cos(90° — b ) sine 
+ sin(90° — b) cos c, 


The second equation in (2.7) is the sine-cosine 
formula, and the corresponding plane formula is 
a trivial one: 

c = b cos A + a cos B. 

This is obtained by substituting the approxima¬ 
tions of sine and cosine into the sine-cosine for¬ 
mula and ignoring all quadratic and higher-order 
terms. In the same way we can use the third equa¬ 
tion in (2.7), the cosine formula, to derive the pla¬ 
nar cosine formula: 

a 2 — b 2 + c 2 — 2be cos A. 


01 - 

sin B sin a = sin A sin b, 

cos B sin a — — cos A sin b cos c + cos b sin c, 

cos a = cos A sin b sin c + cos b cos c. 

(2.7) 

Equations for other sides and angles are obtained 
by cyclic permutations of the sides a, b, c and the 
angles A, B, C. For instance, the first equation 
also yields 

sin C sin b = sin B sin c, 
sin A sin c — sin C sin a. 

All these variations of the sine formula can be 
written in an easily remembered form: 

sin a sin b sin c 

-=-=-. ( 2 . 8 ) 

sin A sin B sin C 

If we take the limit, letting the sides a, b 
and c shrink to zero, the spherical triangle be¬ 
comes a plane triangle. If all angles are expressed 
in radians, we have approximately 


Substituting these approximations into the sine 
formula, we get the familiar sine formula of plane 
geometry: 

a b c 

sin A sin B sin C 


2.2 The Earth 

A position on the Earth is usually given by two 
spherical coordinates (although in some calcu¬ 
lations rectangular or other coordinates may be 
more convenient). If necessary, also a third coor¬ 
dinate, e.g. the distance from the centre, can be 
used. 

The reference plane is the equatorial plane, 
perpendicular to the rotation axis and intersecting 
the surface of the Earth along the equator. Small 
circles parallel to the equator are called paral¬ 
lels of latitude. Semicircles from pole to pole 
are meridians. The geographical longitude is the 
angle between the meridian and the zero merid¬ 
ian passing through Greenwich Observatory. We 
shall use positive values for longitudes east of 
Greenwich and negative values west of Green¬ 
wich. Sign convention, however, varies, and neg¬ 
ative longitudes are not used in maps; so it is usu¬ 
ally better to say explicitly whether the longitude 
is east or west of Greenwich. 

The latitude is usually supposed to mean the 
geographical latitude, which is the angle between 
the plumb line and the equatorial plane. The lat¬ 
itude is positive in the northern hemisphere and 
negative in the southern one. The geographical 
latitude can be determined by astronomical ob¬ 
servations (Fig. 2.7): the altitude of the celestial 
pole measured from the horizon equals the geo¬ 
graphical latitude. (The celestial pole is the inter¬ 
section of the rotation axis of the Earth and the 
infinitely distant celestial sphere; we shall return 
to these concepts a little later.) 
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Celestial pole 



Fig. 2.7 The latitude <p is obtained by measuring the alti¬ 
tude of the celestial pole. The celestial pole can be imag¬ 
ined as a point at an infinite distance in the direction of the 
Earth’s rotation axis 


Because the Earth is rotating, it is slightly flat¬ 
tened. The exact shape is rather complicated, but 
for most purposes it can by approximated by an 
oblate spheroid, the short axis of which coincides 
with the rotation axis (Sect. 7.6). In 1979 the 
International Union of Geodesy and Geophysics 
(IUGG) adopted the Geodetic Reference System 
1980 (GRS-80), which is used when global ref¬ 
erence frames fixed to the Earth are defined. The 
GRS-80 reference ellipsoid has the following di¬ 
mensions: 

equatorial radius a = 6,378,137 m, 
polar radius b = 6,356,752 m, 
flattening / = (a — b)/a 

= 1/298.25722210. 

The shape defined by the surface of the oceans, 
called the geoid, differs from this spheroid at 
most by about 100 m. 

The angle between the equator and the nor¬ 
mal to the ellipsoid approximating the true Earth 
is called the geodetic latitude. Because the sur¬ 
face of a liquid (like an ocean) is perpendicular 
to the plumb line, the geodetic and geographical 
latitudes are practically the same. 



Fig. 2.8 Due to the flattening of the Earth, the geographic 
latitude (f> and geocentric latitude <f>' are different 


Because of the flattening, the plumb line does 
not point to the centre of the Earth except at the 
poles and on the equator. An angle corresponding 
to the ordinary spherical coordinate (the angle be¬ 
tween the equator and the line from the centre to 
a point on the surface), the geocentric latitude <p r 
is therefore a little smaller than the geographic 
latitude </> (Fig. 2.8). 

We now derive an equation between the geo¬ 
graphic latitude (f> and geocentric latitude <//, as¬ 
suming the Earth is an oblate spheroid and the 
geographic and geodesic latitudes are equal. The 
equation of the meridional ellipse is 



The direction of the normal to the ellipse at 
a point (x, y) is given by 

dx a 2 v 

The geocentric latitude is obtained from 
tan </>' = y/x. 


Hence 


tan<// 


y tan cj) (1 
a A 


— e 2 ) tan (p, 


(2.9) 
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where 

e = yj 1 — b 2 /a 2 

is the eccentricity of the ellipse. The difference 
A (p — <p — <t>' has a maximum 11.5' at the latitude 
45°. 

Since the coordinates of celestial bodies in as¬ 
tronomical almanacs are given with respect to the 
centre of the Earth, the coordinates of nearby ob¬ 
jects must be corrected for the difference in the 
position of the observer, if high accuracy is re¬ 
quired. This means that one has to calculate the 
topocentric coordinates, centered at the observer. 
The easiest way to do this is to use rectangular 
coordinates of the object and the observer (Ex¬ 
ample 2.6). 

One arc minute along a meridian is called 
a nautical mile. Since the radius of curvature 
varies with latitude, increasing towards the poles, 
the length of the nautical mile so defined would 
depend on the latitude (1843 m on the equator, 
1862 m at the poles). Therefore one nautical mile 
has been defined to be equal to one minute of arc 
at<p — 45°, whence 1 nautical mile = 1852 m. 


2.3 The Celestial Sphere 

The ancient universe was confined within a fi¬ 
nite spherical shell. The stars were fixed to this 
shell and thus were all equidistant from the Earth, 
which was at the centre of the spherical universe. 
This simple model is still in many ways as use¬ 
ful as it was in antiquity: it helps us to easily un¬ 
derstand the diurnal and annual motions of stars, 
and, more important, to predict these motions in 
a relatively simple way. Therefore we will assume 
for the time being that all the stars are located on 
the surface of an enormous sphere and that we 
are at its centre. Because the radius of this celes¬ 
tial sphere is practically infinite, we can neglect 
the effects due to the changing position of the ob¬ 
server, caused by the rotation and orbital motion 
of the Earth. These effects will be considered later 
in Sects. 2.9 and 2.10. 

Since the distances of the stars are ignored, 
we need only two coordinates to specify their di¬ 
rections. Each coordinate frame has some fixed 
reference plane passing through the centre of 




Fig. 2.9 (a) The apparent motions of stars during a night 
as seen from latitude <p = 45°. (b) The same stars seen 
from latitude 0=10° 

the celestial sphere and dividing the sphere into 
two hemispheres along a great circle. One of the 
coordinates indicates the angular distance from 
this reference plane. There is exactly one great 
circle going through the object and intersecting 
this plane perpendicularly; the second coordinate 
gives the angle between that point of intersection 
and some fixed direction. 


2.4 The Horizontal System 

The most natural coordinate frame from the ob¬ 
server’s point of view is the horizontal frame 
(Fig. 2.9). Its reference plane is the tangent plane 
of the Earth passing through the observer; this 
horizontal plane intersects the celestial sphere 
along the horizon. The point just above the ob¬ 
server is called the zenith and the antipodal point 
below the observer is the nadir. (These two points 
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are the poles corresponding to the horizon.) Great 
circles through the zenith are called verticals. All 
verticals intersect the horizon perpendicularly. 

By observing the motion of a star over the 
course of a night, an observer finds out that it fol¬ 
lows a track like one of those in Fig. 2.9. Stars rise 
in the east, reach their highest point, or culminate, 
on the vertical NZS, and set in the west. The ver¬ 
tical NZS is called the meridian. North and south 
directions are defined as the intersections of the 
meridian and the horizon. 

One of the horizontal coordinates is the alti¬ 
tude or elevation, a, which is measured from the 
horizon along the vertical passing through the ob¬ 
ject. The altitude lies in the range [—90°, +90°]; 
it is positive for objects above the horizon and 
negative for the objects below the horizon. The 
zenith distance, or the angle between the object 
and the zenith, is obviously 

z = 90° — a. (2.10) 

The second coordinate is the azimuth. A; it 
is the angular distance of the vertical of the ob¬ 
ject from some fixed direction. Unfortunately, 
in different contexts, different fixed directions 
are used; thus it is always advisable to check 
which definition is employed. The azimuth is 
usually measured from the north or south, and 
though clockwise is the preferred direction, coun¬ 
terclockwise measurements are also occasionally 
used. In geography, navigation and many other 
fields, the azimuth is measured from the north. 
In this book we have adopted a fairly common 
astronomical convention, measuring the azimuth 
clockwise from the south. Its values are usually 
normalised between 0° and 360° (or -180°- 
180°). The reason for this definition is that some 
other important angles are also measured from 
the south. 

In Fig. 2.9a we can see the altitude and az¬ 
imuth of three stars at some instant. As a star 
moves along its daily track, both of its coordi¬ 
nates will change. Another difficulty with this co¬ 
ordinate frame is its local character. In Fig. 2.9b 
we have the same stars, but the observer is now 
further south. We can see that the coordinates of 
the same star at the same moment are different 


for different observers. Since the horizontal co¬ 
ordinates are time and position dependent, they 
cannot be used, for instance, in star catalogues. 


2.5 The Equatorial System 

The direction of the rotation axis of the Earth re¬ 
mains almost constant and so does the equato¬ 
rial plane perpendicular to this axis. Therefore the 
equatorial plane is a suitable reference plane for 
a coordinate frame that has to be independent of 
time and the position of the observer. 

The intersection of the celestial sphere and 
the equatorial plane is a great circle, which is 
called the equator of the celestial sphere. The 
north pole of the celestial sphere is one of the 
poles corresponding to this great circle. It is also 
the point in the northern sky where the exten¬ 
sion of the Earth’s rotational axis meets the ce¬ 
lestial sphere. The celestial north pole is at a dis¬ 
tance of about one degree (which is equivalent to 
two full moons) from the moderately bright star 
Polaris. The meridian always passes through the 
north pole; it is divided by the pole into north and 
south meridians. 

The angular separation of a star from the equa¬ 
torial plane is not affected by the rotation of the 
Earth. This angle is called the declination 8. 

Stars seem to revolve around the pole once ev¬ 
ery day (Fig. 2.10). To define the second coordi¬ 
nate, we must again agree on a fixed direction, 
unaffected by the Earth’s rotation. From a mathe¬ 
matical point of view, it does not matter which 
point on the equator is selected. However, for 
later purposes, it is more appropriate to employ 
a certain point with some valuable properties, 
which will be explained in the next section. This 
point is called the vernal equinox. Because it used 
to be in the constellation Aries (the Ram), it is 
also called the first point of Aries ant denoted 
by the sign of Aries, T. Now we can define the 
second coordinate as the angle from the vernal 
equinox measured along the equator. This angle 
is the right ascension a (or R.A.) of the object, 
measured counterclockwise from T. 

Since declination and right ascension are in¬ 
dependent of the position of the observer and the 
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motions of the Earth, they can be used in star 
maps and catalogues. As will be explained later, 
in many telescopes one of the axes (the hour axis) 
is parallel to the rotation axis of the Earth. The 
other axis (declination axis) is perpendicular to 
the hour axis. Declinations can be read imme¬ 
diately on the declination dial of the telescope. 
But the zero point of the right ascension seems 
to move in the sky, due to the diurnal rotation of 
the Earth. So we cannot use the right ascension to 
find an object unless we know the direction of the 
vernal equinox. 

Since the south meridian is a well-defined line 
in the sky, we use it to establish a local coordinate 
corresponding to the right ascension. The hour 
angle h is measured clockwise from the meridian. 
The hour angle of an object is not a constant, but 
grows at a steady rate, due to the Earth’s rotation. 
The hour angle of the vernal equinox is called the 



Fig. 2.10 At night, stars seem to revolve around the ce¬ 
lestial pole. The altitude of the pole from the horizon 
equals the latitude of the observer. (Photo Aimo Sillanpaa 
and Pasi Nurmi) 


sidereal time 0. Figure 2.11 shows that for any 
object, 

0 = /; + «, ( 2 . 11 ) 

where h is the object’s hour angle and a its right 
ascension. 

Since hour angle and sidereal time change 
with time at a constant rate, it is practical to ex¬ 
press them in units of time. Also the closely re¬ 
lated right ascension is customarily given in time 
units. Thus 24 hours equals 360 degrees, 1 hour = 
15 degrees, 1 minute of time =15 minutes of arc, 
and so on. All these quantities are in the range 
[0 h, 24 h). This custom also explains the name 
hour circle ; it is a great circle along which the 
hour angle (or right ascension) is constant. 

In practice, the sidereal time can be readily de¬ 
termined by pointing the telescope to an easily 
recognisable star and reading its hour angle on 
the hour angle dial of the telescope. The right as¬ 
cension found in a catalogue is then added to the 
hour angle, giving the sidereal time at the moment 
of observation. For any other time, the sidereal 
time can be evaluated by adding the time elapsed 
since the observation. If we want to be accurate, 
we have to use a sidereal clock to measure time 
intervals. A sidereal clock runs 3 min 56.56 s fast 
a day as compared with an ordinary solar time 


Celestial pole 



Fig. 2.11 The relation between the right ascension a, 
hour angle h and sidereal time @. The right ascension is 
measured counterclockwise from the vernal equinox, the 
hour angle and sidereal time clockwise from the South 
meridian. The sidereal time is the hour angle the vernal 
equinox 
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Fig. 2.12 The nautical triangle for deriving transforma¬ 
tions between the horizontal and equatorial frames 


clock: 

24 h solar time 

= 24 h 3 min 56.56 s sidereal time. (2.12) 

The reason for this is the orbital motion of the 
Earth: stars seem to move faster than the Sun 
across the sky; hence, a sidereal clock must run 
faster. (This is further discussed in Sect. 2.13.) 

Transformations between the horizontal and 
equatorial frames are easily obtained from spher¬ 
ical trigonometry. Comparing Figs. 2.6 and 2.12, 
we find that we must make the following substi¬ 
tutions into (2.5): 

yr = 90 °-A, 9 = a, 

(2.13) 

xj/' = 90° -h, O'= 8, x — 90° — cp. 

The angle <p in the last equation is the altitude of 
the celestial pole, or the latitude of the observer. 
Making the substitutions, we get 

sin /j cos <5 = sinAcosa, 

cos h cos 8 = cos A cos a sin cp + sin a cos cp , 

sin 8 = — cos A cos a cos cp + sin a sin cp . 

(2.14) 


The inverse transformation is obtained by sub¬ 
stituting 

i/r = 90° — h, 9 = 8, 

t/r' = 90° — A, 9' = a, (2.15) 

X = —(90° — 0), 

whence 

sin A cos a = sin h cos 8, 

cos A cos a = cos h cos 8 sin cp — sin 8 cos cp, 

sin a = cos h cos 8 cos cp + sin 8 sin cp ■ 

(2.16) 

Since the altitude and declination are in the 
range [—90°, +90°], it suffices to know the sine 
of one of these angles to determine the other an¬ 
gle unambiguously. Azimuth and right ascension, 
however, can have any value from 0° to 360° (or 
from 0 h to 24 h), and to solve for them, we have 
to know both the sine and cosine to choose the 
correct quadrant. 

The altitude of an object is greatest when it 
is on the south meridian (the great circle arc be¬ 
tween the celestial poles containing the zenith). 
At that moment (called upper culmination, or 
transit ) its hour angle is 0 h. At the lower culmi¬ 
nation the hour angle is h = 12 h. When h = 0 h, 
we get from the last equation in (2.16) 

sin a = cos 8 cos cp + sin 8 sin cp 

= cos (<p — 8) = sin(90° — <p + 8). 

Thus the altitude at the upper culmination is 

90° — cp + 8, if the object culminates 
south of zenith, 

<2 max = 

90 °+cp — 8, if the object culminates 
north of zenith. 

(2.17) 

The altitude is positive for objects with 8 > 
cp — 90°. Objects with declinations less than cp — 
90° can never be seen at the latitude cp. On the 
other hand, when li= 12 h we have 

sin a = — cos 8 cos cp + sin 8 sin cp 

= — cos (8 + <p) = sin(<5 + cp — 90°), 
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and the altitude at the lower culmination is 

flmin = <5 + 0-90°. (2.18) 

Stars with <5 > 90° — </> will never set. For exam¬ 
ple, in Helsinki (</> 60°), all stars with a decli¬ 

nation higher than 30° are such circumpolar stars. 
And stars with a declination less than —30° can 
never be observed there. 

We shall now study briefly how the (a, 8) fra¬ 
me can be established by observations. Suppose 
we observe a circumpolar star at its upper and 
lower culmination (Fig. 2.13). At the upper tran¬ 
sit, its altitude is a max = 90° — (p + 8 and at the 
lower transit, « mln = 8 + <p — 90°. Eliminating the 
latitude, we get 

1 

<5 — ~{dmin T" ^rnax)- (2.19) 

Thus we get the same value for the declination, 
independent of the observer’s location. Therefore 
we can use it as one of the absolute coordinates. 
From the same observations, we can also deter¬ 
mine the direction of the celestial pole as well as 
the latitude of the observer. After these prepara¬ 
tions, we can find the declination of any object by 
measuring its distance from the pole. 

The equator can be now defined as the great 
circle all of whose points are at a distance of 90° 
from the pole. The zero point of the second co¬ 
ordinate (right ascension) can then be defined as 
the point where the Sun seems to cross the equa¬ 
tor from south to north. 

In practice the situation is more complicated, 
since the direction of Earth’s rotation axis changes 


Fig. 2.13 The altitude of a circumpolar star at upper and 
lower culmination 


due to perturbations. Therefore the equatorial co¬ 
ordinate frame is nowadays defined using cer¬ 
tain standard objects the positions of which 
are known very accurately. The best accuracy 
is achieved by using the most distant objects, 
quasars (Sect. 19.7), which remain in the same 
direction over very long intervals of time. 


2.6 Rising and Setting Times 

From the last equation (2.16), we find the hour 
angle h of an object at the moment its altitude 
is a: 

sin a 

cos h — — tan 8 tan cp -\ -. (2.20) 

cos 8 cost/) 

This equation can be used for computing rising 
and setting times. Then a = 0 and the hour an¬ 
gles corresponding to rising and setting times are 
obtained from 

cos h — — tan 8 tan (p. (2.21) 

If the right ascension a is known, we can use 
(2.11) to compute the sidereal time 0. (Later, in 
Sect. 2.14, we shall study how to transform the 
sidereal time to ordinary time.) 

If higher accuracy is needed, we have to cor¬ 
rect for the refraction of light caused by the at¬ 
mosphere of the Earth (see Sect. 2.9). In that 
case, we must use a small negative value for a 
in (2.20). This value, the horizontal refraction, is 
about —34'. 

The rising and setting times of the Sun given 
in almanacs refer to the time when the upper 
edge of the Solar disk just touches the horizon. 
To compute these times, we must set a — —50' 
(= -34' - 16')- 

Also for the Moon almanacs give rising and 
setting times of the upper edge of the disk. Since 
the distance of the Moon varies considerably, we 
cannot use any constant value for the radius of the 
Moon, but it has to be calculated separately each 
time. The Moon is also so close that its direction 
with respect to the background stars varies due to 
the rotation of the Earth. Thus the rising and set¬ 
ting times of the Moon are defined as the instants 
when the altitude of the Moon is — 34' — s + tv. 
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Fig. 2.14 The ecliptic or the apparent orbit of the Sun 
resembles somewhat a sine curve in the equatorial coordi¬ 
nate frame. The ecliptic passes through 13 constellations. 
Since many of them have been named after animals, the 
neighbourhood of the ecliptic is called the zodiac. As¬ 
trologers divide the ecliptic into 12 equal signs of 30° 

where s is the apparent radius (15.5' on the aver¬ 
age) and tv the parallax (57' on the average). The 
latter quantity is explained in Sect. 2.9. 

Finding the rising and setting times of the 
Sun, planets and especially the Moon is compli¬ 
cated by their motion with respect to the stars. 
We can use, for example, the coordinates for the 
noon to calculate estimates for the rising and set¬ 
ting times, which can then be used to interpolate 
more accurate coordinates for the rising and set¬ 
ting times. When these coordinates are used to 
compute new times a pretty good accuracy can 
be obtained. The iteration can be repeated if even 
higher precision is required. 


2.7 The Ecliptic System 

The orbital plane of the Earth, the ecliptic, is the 
reference plane of another important coordinate 
frame. The ecliptic can also be defined as the 
great circle on the celestial sphere described by 
the Sun in the course of one year. This frame is 
used mainly for planets and other bodies of the 
solar system. The orientation of the Earth’s equa¬ 
torial plane remains invariant, unaffected by an¬ 
nual motion. In spring, the Sun appears to move 
from the southern hemisphere to the northern one 
(Fig. 2.14). The time of this remarkable event 
as well as the direction to the Sun at that mo¬ 
ment are called the vernal equinox. At the vernal 


each. (Ophiuchus is unknown to them.) Due to the preces¬ 
sion of the coordinate frame the astrological signs and the 
constellations with the same name are no more aligned. 
Also the sizes of the actual constellations are quite differ¬ 
ent 



Fig. 2.15 The ecliptic geocentric (A., fi ) and heliocentric 
(A', ft') coordinates are equal only if the object is very 
far away. The geocentric coordinates depend also on the 
Earth’s position in its orbit 

equinox, the Sun’s right ascension and declina¬ 
tion are zero. The equatorial and ecliptic planes 
intersect along a straight line directed towards the 
vernal equinox. Thus we can use this direction as 
the zero point for both the equatorial and ecliptic 
coordinate frames. The point opposite the vernal 
equinox is the autumnal equinox, it is the point at 
which the Sun crosses the equator from north to 
south. 

The ecliptic latitude (3 is the angular distance 
from the ecliptic; it is in the range [—90°, +90°]. 
The other coordinate is the ecliptic longitude X, 
measured counterclockwise from the vernal 
equinox (Fig. 2.15). 

Transformation equations between the equa¬ 
torial and ecliptic frames can be derived analo- 
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gously to (2.14) and (2.16): 

sin X cos p — sin S sin s + cos S cos s sin a, 
cos A cos/J = cos <5 cos a, (2.22) 

sin /S = sin S cose — cos 5 sine sin a, 

sin a cos 8 = — sin p sin e + cos ft cos s sin X , 
cos a cos S = cos X cos ft, (2.23) 

sin S = sin p cos e + cos p sin e sin X. 

The angle e appearing in these equations is 
the obliquity of the ecliptic, or the angle between 
the equatorial and ecliptic planes. Its value is 
roughly 23° 26' (a more accurate value is given 
in Box 2.1). 

Depending on the problem to be solved, we 
may encounter heliocentric (origin at the Sun), 
geocentric (origin at the centre of the Earth) or 
topocentric (origin at the observer) coordinates. 
For very distant objects the differences are neg¬ 
ligible, but not for bodies of the solar system. 
To transform heliocentric coordinates to geocen¬ 
tric coordinates or vice versa, we must also know 
the distance of the object. This transformation is 
most easily accomplished by computing the rect¬ 
angular coordinates of the object and the new ori¬ 
gin, then changing the origin and finally evaluat¬ 
ing the new latitude and longitude from the rect¬ 
angular coordinates (see Examples 2.5 and 2.6). 


2.8 The Galactic Coordinates 

For studies of the Milky Way Galaxy, the most 
natural reference plane is the plane of the Milky 
Way (Fig. 2.16). Since the Sun lies very close 
to that plane, we can put the origin at the Sun. 
The galactic longitude l is measured counter¬ 
clockwise (like right ascension) from the direc¬ 
tion of the centre of the Milky Way (in Sagit¬ 
tarius, a = 17 h 45.7 min, S = -29°00'). The 



Celestial 


pole 

Galactic 


centre 

^ Sun 


Fig. 2.16 The galactic coordinates / and b 


galactic latitude b is measured from the galac¬ 
tic plane, positive northwards and negative south¬ 
wards. This definition was officially adopted only 
in 1959, when the direction of the galactic cen¬ 
tre was determined from radio observations ac¬ 
curately enough. The old galactic coordinates Z 1 
and b l had the intersection of the equator and the 
galactic plane as their zero point. 

The galactic coordinates can be obtained from 
the equatorial ones with the transformation equa¬ 
tions 

sin(/N — /) cos b = cos S sin(o' — ap), 
cos(/n — /) cos b = — cosS sin 3p cos (a — ap) 

+ sin <5 cos <5p, 

sin b = cos S cos <5p cos (a — ap) 

+ sin <5 sin<5p, 

(2.24) 

where the direction of the Galactic north pole is 
ap — 12 h 51.4 min, <5p = 27°08 / , and the galactic 
longitude of the celestial pole, /n = 123.0°. 


2.9 Perturbations of Coordinates 

Even if a star remains fixed with respect to the 
Sun, its coordinates can change, due to several 
disturbing effects. Naturally its altitude and az¬ 
imuth change constantly because of the rotation 
of the Earth, but even its right ascension and dec¬ 
lination are not quite free from perturbations. 

Precession The Earth is not quite spherical but 
slightly flattened. Since most of the members of 
the solar system orbit close to the ecliptic, they 
tend to pull the equatorial bulge of the Earth 
towards it. Most of this “flattening” torque is 
caused by the Moon and the Sun. But the Earth 
is rotating and therefore the torque cannot change 
the inclination of the equator relative to the eclip¬ 
tic. Instead, the rotation axis turns in a direction 
perpendicular to the axis and the torque, thus de¬ 
scribing a cone once in roughly 26,000 years. 
This slow turning of the rotation axis is called 
precession (Fig. 2.17). Because of precession, the 
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Fig. 2.17 Due to 

precession the celestial 
pole moves and makes a 
full revolution in about 
26,000 years. For a while it 
is still approaching Polaris, 
but will then begin to 
recede. In about 14,000 it 
will be close to Vega. 
Somewhat over 2000 years 
BC the pole was close to 
the a star of Drace, Thuban 



vernal equinox moves along the ecliptic clock¬ 
wise about 50 seconds of arc every year, thus in¬ 
creasing the ecliptic longitudes of all objects at 
the same rate. At present the rotation axis points 
about one degree away from Polaris, but after 
12,000 years, the celestial pole will be roughly in 
the direction of Vega. The changing ecliptic lon¬ 
gitudes also affect the right ascension and decli¬ 
nation. Thus we have to know the instant of time, 
or epoch , for which the coordinates are given. 

Currently most maps and catalogues use the 
epoch J2000.0, which means the beginning of 
the year 2000, or, to be exact, the noon of Jan¬ 
uary 1, 2000, or the Julian date 2,451,545.0 (see 
Sect. 2.15). 

Let us now derive expressions for the changes 
in right ascension and declination. Taking the last 
transformation equation in (2.23), 

sin S = cos e sin ft + sin e cos ft sin k, 

and differentiating, we get 

cos 5 d<5 = sin e cos ft cos k dk. 

Applying the second equation in (2.22) to the 
right-hand side, we have, for the change in dec¬ 


lination, 

d<5 = dk sine cos a. (2.25) 

By differentiating the equation 

cos a cos 8 = cos /3 cos k, 

we get 

—sina cos 5 da — cos a sin <S d<S 
= — cos ft sinkdk; 

and, by substituting the previously obtained ex¬ 
pression for d<5 and applying the first equation 
(2.22), we have 

sin a cos 5 da 

= dk (cos ft sin k — sin e cos 2 a sin 5) 

= dk (sin <5 sin e + cos 8 cos e sin a 

—sine cos 2 a sin 5). 

Simplifying this, we get 

da = dk (sin a sine tan 8 + cose). (2.26) 

If dk is the annual increment of the ecliptic lon¬ 
gitude (about 50"), the precessional changes in 
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Table 2.1 Precession constants m and n. Here, “a” means 
a tropical year 


Epoch 

m 

n 

1800 

3.07048 s/a 

1.33703 s/a = 

20.0554"/a 

1850 

3.07141 

1.33674 

20.0511 

1900 

3.07234 

1.33646 

20.0468 

1950 

3.07327 

1.33617 

20.0426 

2000 

3.07419 

1.33589 

20.0383 


right ascension and declination in one year are 
thus 


Ecliptic north pole 

Celestial 
pole 



d<5 = dk sine cos a, 

da = dA. (sine sin a tan S + cose). 


Fig. 2.18 Due to precession the rotation axis of the Earth 
turns around the ecliptic north pole. Nutation is the small 
' ' ' wobble disturbing the smooth precessional motion. In this 

figure the magnitude of the nutation is highly exaggerated 


These expressions are usually written in the form 


d<5 = n cos a, 

da = m + n sin a tan S , 


(2.28) 


where 


m — dAcose, 
n — dA sin e 


(2.29) 


are the precession constants. Since the obliq¬ 
uity of the ecliptic is not exactly a constant but 
changes with time, m and n also vary slowly with 
time. However, this variation is so slow that usu¬ 
ally we can regard m and n as constants unless 
the time interval is very long. The values of these 
constants for some epochs are given in Table 2.1. 
For intervals longer than a few decades a more 
rigorous method should be used. Its derivation ex¬ 
ceeds the level of this book, but the necessary for¬ 
mulas are given in Box 2.1. 


Nutation The Moon’s orbit is inclined with re¬ 
spect to the ecliptic, resulting in precession of 
its orbital plane. One revolution takes 18.6 years, 
producing perturbations with the same period in 
the precession of the Earth. This effect, nutation , 
changes ecliptic longitudes as well as the obliq¬ 
uity of the ecliptic (Fig. 2.18). Calculations are 
now much more complicated, but fortunately nu- 
tational perturbations are relatively small, only 
fractions of an arc minute. Thus they can often be 


calculated using the simple formulas of Box 2.1 
or just omitted. 

Parallax If we observe an object from different 
points, we see it in different directions. The dif¬ 
ference of the observed directions (as well as the 
whole phenomenon) is called the parallax. Since 
the amount of parallax depends on the distance 
of the observer from the object, we can utilise 
the parallax to measure distances. Human stereo¬ 
scopic vision is based (at least to some extent) 
on this effect. For astronomical purposes we need 
much longer baselines than the distance between 
our eyes (about 7 cm). Appropriately large and 
convenient baselines are the radius of the Earth 
and the radius of its orbit. 

Distances to the nearest stars can be deter¬ 
mined from the annual parallax , which is the an¬ 
gle subtended by the radius of the Earth’s orbit 
(called the astronomical unit, au) as seen from the 
star. (We shall discuss this further in Sect. 2.10.) 

By diurnal parallax we mean the change of 
direction due to the daily rotation of the Earth. In 
addition to the distance of the object, the diurnal 
parallax also depends on the latitude of the ob¬ 
server. If we talk about the parallax of a body in 
our solar system, we always mean the angle sub¬ 
tended by the Earth’s equatorial radius (6378 km) 
as seen from the object (Fig. 2.19). This equals 
the apparent shift of the object with respect to the 
background stars seen by an observer at the equa¬ 
tor if (s)he observes the object moving from the 
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Fig. 2.19 The horizontal 
parallax n of an object is 
the angle subtended by the 
Earth’s equatorial radius as 
seen from the object 




Fig. 2.20 The effect of aberration on the apparent direc¬ 
tion of an object, (a) Observer at rest, (b) Observer in mo¬ 
tion 

horizon to the zenith. The parallax of the Moon, 
for example, is about 51', and that of the Sun 
8.79". 

In astronomy parallax may also refer to dis¬ 
tance in general, even if it is not measured using 
the shift in the observed direction. 

Aberration Because of the finite speed of light, 
an observer in motion sees an object shifted in 
the direction of her/his motion (Fig. 2.20). This 
change of apparent direction is called the aber¬ 
ration. To derive the exact value we have to use 
the special theory of relativity, but for practical 
purposes it suffices to use the approximate value 

a — - sin$, [o] = rad, (2.30) 
c 



Fig. 2.21 A telescope is pointed in the true direction of 
a star. It takes a time t = l/c for the light to travel the 
length of the telescope. The telescope is moving with ve¬ 
locity v , which has a component v sin@, perpendicular to 
the direction of the light beam. The beam will hit the bot¬ 
tom of the telescope displaced from the optical axis by 
a distance x = tv sin0 = I(v/c) sin0. Thus the change of 
direction in radians is a = x/1 = (v/c) sin 9 

where v is the velocity of the observer, c is the 
speed of light and 6 is the angle between the 
true direction of the object and the velocity vector 
of the observer (Fig 2.21). The greatest possible 
value of the aberration due to the orbital motion 
of the Earth, v/c , called the aberration constant, 
is 21". The maximal shift due to the Earth’s ro¬ 
tation, the diurnal aberration constant, is much 
smaller, about 0.3". 

Refraction Since light is refracted by the at¬ 
mosphere, the direction of an object differs from 
the true direction by an amount depending on the 
atmospheric conditions along the line of sight. 
Since this refraction varies with atmospheric 
pressure and temperature, it is very difficult to 
predict it accurately. However, an approximation 
good enough for most practical purposes is eas¬ 
ily derived. If the object is not too far from the 
zenith, the atmosphere between the object and 
the observer can be approximated by a stack of 
parallel planar layers, each of which has a certain 
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index of refraction n, (Fig. 2.22). Outside the at¬ 
mosphere, we have n — 1. 

Let the true zenith distance be z and the appar¬ 
ent one, £. Using the notations of Fig. 2.22, we 
obtain the following equations for the boundaries 
of the successive layers: 

sin; = Hk sin Zk, 


ri 2 sinz 2 = «i sin;i, 

«i sin ;i = no sin f. 

Simplifying this set of equations we get 

sin; = «osin£. (2.31) 

When the refraction angle R = z — £ is small and 
is expressed in radians, we have 

zto sin 4T = sin; = sin(/? + £) 

= sin R cos f + cos R sin £ 

~ Rcost; + sin£. 

Thus we get 

R = (no — 1) tan£, [/?] = rad. (2.32) 

The index of refraction depends on the density 
of the air, which further depends on the pressure 



Fig. 2.22 Refraction of a light ray travelling through the 
atmosphere. If the altitude is not very low we can think 
that the atmosphere consists of planar layers each of which 
has its own index of refraction. Outside the atmosphere the 
index is n = 1 and increases towards the ground 


and temperature. When the altitude is over 15°, 
we can use an approximate formula 

R =---0.00452° tan(90° - a), (2.33) 

273 +T 

where a is the altitude in degrees, T tempera¬ 
ture in degrees Celsius, and P the atmospheric 
pressure in hectopascals (or, equivalently, in mil¬ 
libars). At lower altitudes the curvature of the at¬ 
mosphere must be taken into account. An approx¬ 
imate formula for the refraction is then 

P 0.1594 + 0.0196a + 0.00002a 2 

“ 273 + T 1 + 0.505a + 0.0845a 2 ' 

(2.34) 

These formulas are widely used, although they 
are against the rules of dimensional analysis. 
To get correct values, all quantities must be ex¬ 
pressed in correct units. Figure 2.23 shows the 
refraction under different conditions evaluated 
from these formulas. 

Altitude is always (except very close to zenith) 
increased by refraction. On the horizon the change 
is about 34', which is slightly more than the di¬ 
ameter of the Sun. When the lower limb of the 
Sun just touches the horizon, the Sun has in real¬ 
ity already set. 

Light coming from the zenith is not refracted 
at all if the boundaries between the layers are hor¬ 
izontal. Under some climatic conditions, a bound¬ 
ary (e.g. between cold and warm layers) can be 
slanted, and in this case, there can be a small 



Fig. 2.23 Refraction at different altitudes. The refraction 
angle R tells how much higher the object seems to be com¬ 
pared with its true altitude a. Refraction depends on the 
density and thus on the pressure and temperature of the 
air. The upper curves give the refraction at sea level dur¬ 
ing rather extreme weather conditions. At the altitude of 
2.5 kilometers the average pressure is only 700 hPa, and 
thus the effect of refraction smaller (lowest curve ) 
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zenith refraction, which is of the order of a few 
arc seconds. 

Stellar positions given in star catalogues are 
mean places , from which the effects of parallax, 
aberration and nutation have been removed. The 
mean place of the date (i.e. at the observing time) 
is obtained by correcting the mean place for the 
proper motion of the star (Sect. 2.10) and preces¬ 
sion. The apparent place is obtained by correct¬ 
ing this place further for nutation, parallax and 
aberration. There is a catalogue published annu¬ 
ally that gives the apparent places of certain ref¬ 
erences stars at intervals of a few days. These po¬ 
sitions have been corrected for precession, nuta¬ 
tion, parallax and annual aberration. The effects 
of diurnal aberration and refraction are not in¬ 
cluded because they depend on the location of the 
observer. 


2.10 Positional Astronomy 

The position of a star can be measured either with 
respect to some reference stars (relative astrome¬ 
try) or with respect to a fixed coordinate frame 
(absolute astrometry). 

Absolute coordinates are usually determined 
using a meridian circle, which is a telescope 
that can be turned only in the meridional plane 
(Fig. 2.27). It has only one axis, which is aligned 
exactly in the east-west direction. Since all stars 
cross the meridian in the course of a day, they all 
come to the field of the meridian circle at some 
time or other. When a star culminates, its altitude 
and the time of the transit are recorded. If the time 
is determined with a sidereal clock, the sidereal 
time immediately gives the right ascension of the 
star, since the hour angle is h — 0 h. The other 
coordinate, the declination <5, is obtained from the 
altitude: 

S = a — (90° — 0), 

where a is the observed altitude and cf> is the geo¬ 
graphic latitude of the observatory. 

Relative coordinates are measured on photo¬ 
graphic plates (Fig. 2.25) or CCD images con¬ 
taining some known reference stars. The scale of 
the plate as well as the orientation of the coordi- 



Fig. 2.24 The trigonometric parallax n of a star S is the 
angle subtended by the radius of the orbit of the Earth, or 
one astronomical unit, as seen from the star 

nate frame can be determined from the reference 
stars. After this has been done, the right ascen¬ 
sion and declination of any object in the image 
can be calculated if its coordinates in the image 
are measured. 

All stars in a small field are almost equally af¬ 
fected by the dominant perturbations, precession, 
nutation, and aberration. The much smaller effect 
of parallax, on the other hand, changes the rela¬ 
tive positions of the stars. 

The shift in the direction of a star with respect 
to distant background stars due to the annual mo¬ 
tion of the Earth is called the trigonometric par¬ 
allax of the star. It gives the distance of the star: 
the smaller the parallax, the farther away the star 
is. Trigonometric parallax is, in fact, the only di¬ 
rect method we currently have of measuring dis¬ 
tances to stars. Later we shall be introduced to 
some other, indirect methods, which require cer¬ 
tain assumptions on the motions or structure of 
stars. The same method of triangulation is em¬ 
ployed to measure distances of earthly objects. 
To measure distances to stars, we have to use the 
longest baseline available, the diameter of the or¬ 
bit of the Earth. 

During the course of one year, a star will ap¬ 
pear to describe a circle if it is at the pole of the 
ecliptic, a segment of line if it is in the ecliptic, or 
an ellipse otherwise. The semimajor axis of this 
ellipse is called the parallax of the star. It is usu¬ 
ally denoted by 7r. It equals the angle subtended 
by the radius of the Earth’s orbit as seen from the 
star (Fig. 2.24). 
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The unit of distance used in astronomy is par¬ 
sec (pc). At a distance of one parsec, one astro¬ 
nomical unit subtends an angle of one arc sec¬ 
ond. Since one radian is about 206,265", 1 pc 
equals 206,265 au. Furthermore, because 1 au = 
1.496 x 10 11 m, 1 pc « 3.086 x 10 16 m. If the 
parallax is given in arc seconds, the distance is 
simply 


r — l/;r, [r]=pc, [t r]=". (2.35) 

In popular astronomical texts, distances are 
usually given in light-years, one light-year be¬ 
ing the distance light travels in one year, or 9.5 x 
10 15 m. Thus one parsec is about 3.26 light-years. 

The first parallax measurement was accom¬ 
plished by Friedrich Wilhelm Bessel (1784-1846) 
in 1838. He found the parallax of 61 Cygni to be 
0.3". The nearest star Proxima Centauri has a par¬ 
allax of 0.762" and thus a distance of 1.31 pc. 

In addition to the motion due to the annual par¬ 
allax, many stars seem to move slowly in a direc¬ 
tion that does not change with time. This effect is 
caused by the relative motion of the Sun and the 
stars through space; it is called the proper motion. 
The appearance of the sky and the shapes of the 
constellations are constantly, although extremely 
slowly, changed by the proper motions of the stars 
(Fig. 2.26). 

The velocity of a star with respect to the Sun 
can be divided into two components (Fig. 2.28), 
one of which is directed along the line of sight 
(the radial component or the radial velocity), and 
the other perpendicular to it (the tangential com¬ 
ponent). The tangential velocity results in the 
proper motion, which can be measured by taking 
pictures at intervals of several years or decades. 
The proper motion pt has two components, one 
giving the change in declination pts and the other, 
in right ascension, pt a cos <5 . The coefficient cos 5 
is used to correct the scale of right ascension: 
hour circles (the great circles with a — constant) 
approach each other towards the poles, so the co¬ 
ordinate difference must be multiplied by cos <5 
to obtain the true angular separation. The total 
proper motion is 


The greatest known proper motion belongs to 
Barnard’s Star, which moves across the sky at the 
enormous speed of 10.3 arc seconds per year. It 
needs less than 200 years to travel the diameter 
of a full moon. 

In order to measure proper motions, we must 
observe stars for a long time. The radial compo¬ 
nent, on the other hand, is readily obtained from 
a single observation, thanks to the Doppler effect. 
By the Doppler effect we mean the change in fre¬ 
quency and wavelength of radiation due to the ra¬ 
dial velocity of the radiation source. The same ef¬ 
fect can be observed, for example, in the sound 
of an ambulance, the pitch being higher when the 
ambulance is approaching and lower when it is 
receding. 

The formula for the Doppler effect for small 
velocities can be derived as in Fig. 2.29. The 
source of radiation transmits electromagnetic 
waves, the period of one cycle being T. In time T, 
the radiation approaches the observer by a dis¬ 
tance s = cT, where c is the speed of propa¬ 
gation. During the same time, the source moves 
with respect to the observer a distance s' = vT, 
where v is the speed of the source, positive for 
a receding source and negative for an approach¬ 
ing one. We find that the length of one cycle, the 
wavelength X, equals 

X = s + s' = cT + vT. 


If the source were at rest, the wavelength of its 
radiation would be ko = cT. The motion of the 
source changes the wavelength by an amount 

AX = X — Xq = cT + vT — cT — vT, 


and the relative change AX of the wavelength is 


AX _v 
Xo c' 


(2.37) 


This is valid only when v <5IC c. For very high ve¬ 
locities, we must use the relativistic formula 


AX 1 1 + v/c 
Xo y 1 - v/c 


(2.38) 


M = JB-l cos 2 S + p, 2 s . 


These formulas are valid only for electromag- 

(2.36) 

netic radiation. The Doppler shift of e.g. sound 






2.10 Positional Astronomy 


29 


» 2 3 4 5 



i 


i? i 

A 

1 

is i 

b t; ii i 

?o ?i n ?3 ? 

n ?b 


















09' 




















ff» 





















#f* 




















ft* 




















ft* 




















Si* 





















«• 

** 






















n ♦ 















































. 























#* 





















































<?♦ 

ai 























0* 



























1 





















#- 

** 






















u 























*- 

«■> 






















n 

* 






















•c 






















*• 






















•>• 




















ft® 

















If 
































* #»* >f H * tf #*• 

ti • 

H- 


a- 

u • 

f • 

#• 

0- 

#• 

$*- 

0r 

a 

« 


M 

«- 

st- 

#»• 


+7\ 

9<kuJ 



Fig. 2.25 (a) A plate photographed for the Carte du Ciel 
project in Helsinki on November 21, 1902. The centre of 
the field is at a = 18 h 40 min, S = 46°, and the area is 
2° x 2°. Distance between coordinate lines (exposed sep¬ 
arately on the plate) is 5 minutes of arc. (b) The framed 
region on the same plate, (c) The same area on a plate 


taken on November 7, 1948. The bright star in the lower 
right comer (SAO 47747) has moved about 12 seconds 
of arc. The brighter, slightly drop-shaped star to the left 
is a binary star (SAO 47767); the separation between its 
components is 8" 
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Fig. 2.26 Proper motions of stars slowly change the appearance of constellations, (a) The Big Dipper during the last 
ice age 30,000 years ago, (b) nowadays, and (c) after 30,000 years 



Fig. 2.27 Astronomers discussing observations with the 
transit circle of Helsinki Observatory in 1904 

waves is different depending on the motion of the 
source or the observer. 

In astronomy the Doppler effect can be seen 
in stellar spectra, in which the spectral lines are 
often displaced towards the blue (shorter wave¬ 
lengths) or red (longer wavelengths) end of the 
spectrum. A blueshift means that the star is ap¬ 
proaching, while a redshift indicates that it is re¬ 
ceding. 

The displacements due to the Doppler effect 
are usually very small. In order to measure them, 
a reference spectrum is exposed on the plate next 
to the stellar spectrum. Now that CCD-cameras 
have replaced photographic plates, separate cal¬ 



Fig. 2.28 The radial and tangential components, v r and 
n t of the velocity v of a star. The latter component is ob¬ 
served as proper motion 



Fig. 2.29 The wavelength of radiation increases if the 
source is receding 

ibration exposures of reference spectra are taken 
to determine the wavelength scale (Sect. 3.3). The 
lines in the reference spectrum are produced by 
a light source at rest in the laboratory. If the ref¬ 
erence spectrum contains some lines found also 
in the stellar spectrum, the displacements can be 
measured. 

Also the spectra of distant galaxies show red- 
shifts, but only a part of that is due to the Doppler 
effect. Instead, the redshifts are caused by the ex¬ 
pansion of the Universe (Chap. 20). Also, gravi¬ 
tational fields cause redshifts as predicted by the 
general relativity (Appendix B). 

Displacements of spectral lines give the ra¬ 
dial velocity v r of the star, and the proper mo¬ 
tion ijl can be measured from photographic plates 
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or CCD images. To find the tangential velocity v t , 
we have to know the distance r, obtainable from 
e.g. parallax measurements. Tangential velocity 
and proper motion are related by 

t't = jtir. (2.39) 

If /x is given in arc seconds per year and r in 
parsecs we have to make the following unit trans¬ 
formations to get u t in km/s: 

1 rad = 206,265", 1 year = 3.156 x 10 7 s, 

1 pc = 3.086 x 10 13 km. 

Hence 

u t = 4.74 fir, [i> t ] = km/s, 

(2.40) 

[/x]=7a, [r ] = pc. 

The total velocity v of the star is then 

v = + u t 2 . (2.41) 


2.11 Constellations 

At any one time, about 1000-1500 stars can be 
seen in the sky (above the horizon). Under ideal 
conditions, the number of stars visible to the 
naked eye can be as high as 3000 on a hemi¬ 
sphere, or 6000 altogether. Some stars seem to 
form figures vaguely resembling something; for 
ages they have been ascribed to various mytho¬ 
logical and other animals. This grouping of stars 
into constellations is a product of human imagi¬ 
nation without any physical basis. Different cul¬ 
tures have different constellations, depending on 
their mythology, history and environment. 

About half of the shapes and names of the 
constellations we are familiar with date back 
to Mediterranean antiquity. But the names and 
boundaries were far from unambiguous as late as 
the 19th century. Therefore the International As¬ 
tronomical Union (IAU) confirmed fixed bound¬ 
aries at its 1928 meeting. 

The official boundaries of the constellations 
were established along lines of constant right as¬ 
cension and declination for the epoch 1875. Dur¬ 
ing the time elapsed since then, precession has 


noticeably turned the equatorial frame. However, 
the boundaries remain fixed with respect to the 
stars. So a star belonging to a constellation will 
belong to it forever (unless it is moved across the 
boundary by its proper motion). 

The names of the 88 constellations confirmed 
by the IAU are given in Table C.21 at the end of 
the book. The table also gives the abbreviation of 
the Latin name, its genitive (needed for names of 
stars) and the English name. 

In his star atlas Uranometria (1603) Johannes 
Bayer started the current practice to denote the 
brightest stars of each constellation by Greek let¬ 
ters. The brightest star is usually a (alpha), e.g. 
Deneb in the constellation Cygnus is a Cygni, 
which is abbreviated as a Cyg. The second bright¬ 
est star is fi (beta), the next one y (gamma) and 
so on. There are, however, several exceptions to 
this rule; for example, the stars of the Big Dipper 
are named in the order they appear in the con¬ 
stellation. After the Greek alphabet has been ex¬ 
hausted, Latin letters can be employed. Another 
method is to use numbers, which are assigned 
in the order of increasing right ascension; e.g. 
30 Tau is a bright binary star in the constellation 
Taurus. These numbers are based on the star cata¬ 
logue Historia Ccelestis Britannica (1725) by the 
first Astronomer Royal, John Flamsteed. More¬ 
over, variable stars have their special identifiers 
(Sect. 13.1). About two hundred bright stars have 
a proper name; e.g. the bright a Aur is called also 
Capella. 

As telescopes evolved, more and more stars 
were seen and catalogued. It soon became im¬ 
practical to continue this method of naming. 
Thus most of the stars are known only by their 
catalogue index numbers. One star may have 
many different numbers; e.g. the abovementioned 
Capella (a Aur) is number BD+45° 1077 in the 
Bonner Durchmusterung and HD 34029 in the 
Henry Draper catalogue. 


2.12 Star Catalogues and Maps 

The first actual star catalogue was published by 
Ptolemy in the second century; this catalogue 
appeared in the book to be known later as Al¬ 
magest (which is a Latin corruption of the name 
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of the Arabic translation, Al-mijisti). It had 1025 
entries; the positions of these bright stars had 
been measured by Hipparchos 250 years earlier. 
Ptolemy’s catalogue was the only widely used 
one prior to the 17th century. 

The first catalogues still being used by as¬ 
tronomers were prepared under the direction of 
Friedrich Wilhelm August Argelander (1799— 
1875). Argelander worked in Turku and later 
served as professor of astronomy in Helsinki, but 
he made his major contributions in Bonn. Using 
a 72 mm telescope, he and his assistants mea¬ 
sured the positions and estimated the magnitudes 
of 320,000 stars. The catalogue, Bonner Durch- 
musterung, contains nearly all stars brighter than 
magnitude 9.5 between the north pole and dec¬ 
lination —2°. (Magnitudes are further discussed 
in Chap. 4.) Argelander’s work was later used as 
a model for two other large catalogues covering 
the whole sky. The total number of stars in these 
catalogues is close to one million (Fig. 2.30). 

The purpose of these Durchmusterungen or 
general catalogues was to systematically list 
a great number of stars. In the zone catalogues, 
the main goal is to give the positions of stars 
as exactly as possible. A typical zone cata¬ 
logue is the German Katalog der Astronomischen 
Gesellschaft (AGK). Twelve observatories, each 
measuring a certain region in the sky, contributed 
to this catalogue. The work was begun in the 
1870’s and completed at the turn of the century. 

General and zone catalogues were based on vi¬ 
sual observations with a telescope. The evolution 
of photography made this kind of work unneces¬ 
sary at the end of the 19th century. Photographic 
plates could be stored for future purposes, and 
measuring the positions of stars became easier 
and faster, making it possible to measure many 
more stars. 

A great international program was started at 
the end of the 19th century in order to photograph 
the entire sky. Eighteen observatories participated 
in this Carte clu Ciel project, all using similar in¬ 
struments and plates. The positions of stars were 
first measured with respect to a rectangular grid 
exposed on each plate (Fig. 2.25a). These coor¬ 
dinates could then be converted into declination 
and right ascension. 


The old plates have been useful also much 
later. By observing the same areas again the 
proper motions of many stars can be measured. 

Positions of stars in catalogues are measured 
with respect to certain comparison stars, the co¬ 
ordinates of which are known with high accuracy. 
The coordinates of these reference stars are pub¬ 
lished in fundamental catalogues. The first such 
catalogue was needed for the AGK catalogue; it 
was published in Germany in 1879. This Funda¬ 
mental Katalog (FK 1) gives the positions of over 
500 stars. 

A widely used catalogue is the SAO cata¬ 
logue, published by the Smithsonian Astrophysi- 
cal Observatory in the 1960’s. It contains the ex¬ 
act positions, magnitudes, proper motions, spec¬ 
tral classifications, etc. of 258,997 stars brighter 
than magnitude 9. The catalogue was accompa¬ 
nied by a star map containing all the stars in the 
catalogue. 

In the 1990’s a large astrometric catalogue, 
PPM (Positions and Proper Motions), was pub¬ 
lished to replace the AGK and SAO catalogues. It 
contained all stars brighter than 7.5 magnitudes, 
and was almost complete to magnitude 8.5. Al¬ 
together, the four volumes of the catalogue con¬ 
tained information on 378,910 stars. 

The PPM was effectively replaced by the Ty¬ 
cho catalogue from Hipparcos satellite. Hippar- 
cos was the first astrometric satellite, and was 
launched by the European Space Agency (ESA) 
in 1989. Although Hipparcos didn’t reach the 
planned geosynchronous orbit, it gave exact posi¬ 
tions of over a hundred thousand stars. The Hip¬ 
parcos catalogue, based on the measurements of 
the satellite, contains astrometric and photomet¬ 
ric data of 118,000 stars. The coordinates are pre¬ 
cise to a couple of milliarcseconds. The less pre¬ 
cise Tycho catalogue contains the data of about 
one million stars. 

In 1999 and 2000, the sixth version of the 
Fundamental Katalog, the FK6, was published. 
It combined the Hipparcos data and FK5 for 
4150 fundamental stars. The typical mean error in 
proper motion was 0.35 milliarcseconds per year 
for the basic stars. With the advance of the Inter¬ 
net, the printed versions of star catalogues were 
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a) 



Fig. 2.30 The representations in four atlases of the 
Hyades cluster in the constellation Taurus, (a) Heis: 
Atlas Coelestis, published in 1872. (b) Bonner Durch- 
musterung. (c) SAO, (d) Palomar Sky Atlas, red plate. The 
big blob is the brightest star of Taurus, or a Tauri alias 


Aldebaran. (e) All the stars in the Tycho Catalog, num¬ 
bering over one million, are marked on an all-sky chart. 
The bright lane is the Milky Way. (Picture David Seal, 
NASA/JPL/Caltech) 


discontinued in the first years of the new millen¬ 
nium, and the catalogues were moved to the In¬ 
ternet. 


In the current ICRS system (International Co¬ 
ordinate Reference System) the celestial coordi¬ 
nates are fixed to distant quasars, the positions 
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of which remain pretty constant with time. The 
coordinates of the epoch J2000.0 correspond to 
the coordinates of this fundamental system quite 
closely. 

With the new media, the size of the star cat¬ 
alogues exploded. The first Hubble Guide Star 
Catalog from the early 1990’s contained 18 mil¬ 
lion stars and the second Guide Star Catalog 
from the year 2001, nearly 500 million stars. It 
was surpassed by the U.S. Naval Observatory 
USNO-Bl.O Catalog, which contains entries for 
1,024,618,261 stars and galaxies from digitised 
images of several photographic sky surveys. The 
catalogue presents right ascension and declina¬ 
tion, proper motion and magnitude estimates. 

The next step in the accuracy of astrometry 
will be achieved in the 2010’s with a new Eu¬ 
ropean astrometric satellite. The Gaia satellite 
launched in 2013 has improved the accuracy to 
about 10 -5 seconds of arc. 

The importance of astrometry is not limited 
to star maps. Astrometric observations give ba¬ 
sic information with deeper physical content also. 
When the distance of a star is known, its true radi¬ 
ation power can be calculated from the observed 
brightness (Chap. 4), giving information about 
the structure and evolution of the star (Chaps. 11 
and 12). Motions of stars are related e.g. to the 
mass distribution of the Milky Way (Chap. 18). 

Star maps have been published since ancient 
times, but the earliest maps were globes show¬ 
ing the celestial sphere as seen from the outside. 
At the beginning of the 17th century, a German, 
Johannes Bayer, published the first map show¬ 
ing the stars as seen from inside the celestial 
sphere, as we see them in the sky. Constellations 
were usually decorated with drawings of mytho¬ 
logical figures. The Uranometria Nova (1843) by 
Argelander represents a transition towards mod¬ 
em maps: mythological figures are beginning to 
fade away. The map accompanying the Bonner 
Durchmusterung carried this evolution to its ex¬ 
treme. The sheets contain nothing but stars and 
coordinate lines. 

Most maps are based on star catalogues. Pho¬ 
tography made it possible to produce star maps 
without the cataloguing stage. The most impor¬ 
tant of such maps is a photographic atlas the 


full name of which is The National Geographic 
Society—Palomar Observatory Sky Atlas. The 
plates for this atlas were taken with the 1.2 m 
Schmidt camera on Mount Palomar. The Palomar 
Sky Atlas was completed in the 1950’s. It con¬ 
sists of 935 pairs of photographs: each region has 
been photographed in red and blue light. The size 
of each plate is about 35 cm x 35 cm, covering 
an area of 6.6° x 6.6°. The prints are negatives 
(black stars on a light background), because in 
this way, fainter objects are visible. The limiting 
magnitude is about 19 in blue and 20 in red. 

The Palomar atlas covers the sky down to 
—30°. Work to map the rest of the sky was car¬ 
ried out later at two observatories in the south¬ 
ern hemisphere, at Siding Spring Observatory in 
Australia, and at the European Southern Obser¬ 
vatory (ESO) in Chile. The instruments and the 
scale on the plates are similar to those used earlier 
for the Palomar plates, but the atlas is distributed 
on film transparencies instead of paper prints. 

For amateurs there are several star maps of 
various kinds. Some of them are mentioned in the 
references. 

Several star catalogues can nowadays copied 
freely from the Internet. Thus anyone with suit¬ 
able programming skills can make star maps ap¬ 
propriate for her own purposes. 


2.13 Sidereal and Solar Time 

Time measurements can be based on the rotation 
of the Earth, orbital motion around the Sun, or 
on atomic clocks. The last-mentioned will be dis¬ 
cussed in the next section. Here we consider the 
sidereal and solar times related to the rotation of 
the Earth. 

We defined the sidereal time as the hour an¬ 
gle of the vernal equinox. A good basic unit is a 
sidereal day , which is the time between two suc¬ 
cessive upper culminations of the vernal equinox. 
After one sidereal day the celestial sphere with 
all its stars has returned to its original position 
with respect to the observer. The flow of sidereal 
time is as constant as the rotation of the Earth. 
The rotation rate is slowly decreasing, and thus 
the length of the sidereal day is increasing. In ad¬ 
dition to the smooth slowing down irregular vari- 
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Fig. 2.31 One sidereal day is the time between two 
successive transits or upper culminations of the vernal 
equinox. By the time the Earth has moved from A to B, 
one sidereal day has elapsed. The angle A is greatly exag¬ 
gerated; in reality, it is slightly less than one degree 


ations of the order of one millisecond have been 
observed. 

Unfortunately, also the sidereal time comes in 
two varieties, apparent and mean. The apparent 
sidereal time is determined by the true vernal 
equinox, and so it is obtained directly from ob¬ 
servations. 

Because of the precession the ecliptic longi¬ 
tude of the vernal equinox increases by about 
50" a year. This motion is very smooth. Nutation 
causes more complicated wobbling. The mean 
equinox is the point where the vernal equinox 
would be if there were no nutation. The mean 
sidereal time is the hour angle of this mean 
equinox. 

The difference of the apparent and mean side¬ 
real time is called the equation of equinoxes: 

©a — @m = Ai/ccose, (2.42) 

where e is the obliquity of the ecliptic at the in¬ 
stant of the observation, and Ai//, the nutation in 
longitude. This value is tabulated for each day 
e.g. in the Astronomical Almanac. It can also be 
computed from the formulae given in Box 2.1. It 
is at most about one second, so it has to be taken 
into account only in the most precise calculations. 

Figure 2.31 shows the Sun and the Earth at 
vernal equinox. When the Earth is at the point A , 
the Sun culminates and, at the same time, a new 
sidereal day begins in the city with the huge black 
arrow standing in its central square. After one 


sidereal day, the Earth has moved along its orbit 
almost one degree of arc to the point B. There¬ 
fore the Earth has to turn almost a degree further 
before the Sun will culminate. The solar or syn¬ 
odic day is therefore 3 min 56.56 s (sidereal time) 
longer than the sidereal day. This means that the 
beginning of the sidereal day will move around 
the clock during the course of one year. After 
one year, sidereal and solar time will again be in 
phase. The number of sidereal days in one year is 
one higher than the number of solar days. 

When we talk about rotation periods of plan¬ 
ets, we usually mean sidereal periods. The length 
of day, on the other hand, means the rotation pe¬ 
riod with respect to the Sun. If the orbital period 
around the Sun is P, sidereal rotation period r* 
and synodic day r, we now know that the num¬ 
ber of sidereal days in time P, P/ t* , is one higher 
than the number of synodic days, P/z: 


P 

T* 


P 

r 


= 1, 


or 


1 


1 


r r* 


1 

~P' 


(2.43) 


This holds for a planet rotating in the direction of 
its orbital motion (counterclockwise). If the sense 
of rotation is opposite, or retrograde, the number 
of sidereal days in one orbital period is one less 
than the number of synodic days, and the equa¬ 
tion becomes 


1 1 1 

- = - + -■ (2.44) 

r r* P 

For the Earth, we have P — 365.2564 d, and 
r = ld, whence (2.43) gives r* = 0.99727 d = 
23 h 56 min 4 s, solar time. 

Since our everyday life follows the alternation 
of day and night, it is more convenient to base our 
timekeeping on the apparent motion of the Sun 
rather than that of the stars. Unfortunately, the so¬ 
lar time does not flow at a constant rate. There are 
two reasons for this. First, the orbit of the Earth is 
not exactly circular, but an ellipse, which means 
that the velocity of the Earth along its orbit is 
not constant. Second, the Sun moves along the 
ecliptic, not the equator. Thus its right ascension 
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does not increase at a constant rate. The change 
is fastest at the end of December (4 min 27 s per 
day) and slowest in mid-September (3 min 35 s 
per day). As a consequence, the hour angle of the 
Sun (which determines the solar time) also grows 
at an uneven rate. 

To find a solar time flowing at a constant rate, 
we define a fictitious mean sun, which moves 
along the celestial equator with constant angu¬ 
lar velocity, making a complete revolution in one 
year. By year we mean here the tropical year, 
which is the time it takes for the Sun to move 
from one vernal equinox to the next. In one trop¬ 
ical year, the right ascension of the Sun increases 
exactly 24 hours. The length of the tropical year 
is 365 d 5 h 48 min 46 s = 365.2422 d. Since the 
direction of the vernal equinox moves due to pre¬ 
cession, the tropical year differs from the sidereal 
year, during which the Sun makes one revolution 
with respect to the background stars. One sidereal 
year is 365.2564 d. 

Using our artificial mean sun, we now define 
an evenly flowing solar time, the mean solar time 
(or simply mean time) 7m, which is equal to the 
hour angle /im of the centre of the mean sun plus 
12 hours (so that the date will change at midnight, 
to annoy astronomers): 

7m = Am + 12 h. (2.45) 

The difference between the true solar time T 
and the mean time 7 m is called the equation of 
time: 

E.T. = T - T m . (2.46) 

(In spite of the identical abbreviation, this has 
nothing to do with a certain species of little green 
men.) The greatest positive value of E.T. is about 
16 minutes and the greatest negative value about 
—14 minutes (see Fig. 2.32). This is also the dif¬ 
ference between the true noon (the meridian tran¬ 
sit of the Sun) and the mean noon. 

Both the true solar time and mean time are 
local times, depending on the hour angle of the 
Sun, real or artificial. If one observes the true so¬ 
lar time by direct measurement and computes the 
mean time from (2.46), a digital watch will prob¬ 
ably be found to disagree with both of them. The 
reason for this is that we do not use local time in 



Fig. 2.32 Equation of time. A sundial always shows (if 
correctly installed) true local solar time. To find the local 
mean time the equation of time must be subtracted from 
the local solar time 

our everyday life; instead we use the zonal time 
of the nearest time zone. 

In the past, each city had its own local time. 
When travelling became faster and more popu¬ 
lar, the great variety of local times became an in¬ 
convenience. At the end of the 19th century, the 
Earth was divided into 24 zones, the time of each 
zone differing from the neighbouring ones by one 
hour. On the surface of the Earth, one hour in time 
corresponds to 15° in longitude; the time of each 
zone is determined by the local mean time at one 
of the longitudes 0°, 15°,_345°. 

The time of the zero meridian going through 
Greenwich is used as an international reference, 
Universal Time. In most European countries, time 
is one hour ahead of this (Fig. 2.33). 

In summer, many countries switch to daylight 
saving tune, during which time is one hour ahead 
of the ordinary time. The purpose of this is to 
make the time when people are awake coincide 
with daytime in order to save electricity, particu¬ 
larly in the evening, when people go to bed one 
hour earlier. During daylight saving time, the dif¬ 
ference between the true solar time and the offi¬ 
cial time can grow even larger. It has also been 
criticised that the disadvantages of the daylight 
saving time are much more severe than its profits 
in energy saving. Many people have serious diffi¬ 
culties to adapt to a different daily rhythm twice 
every year. Also, confusingly, different countries 
use different rules for the daylight saving time. 

In the EU countries the daylight saving time 
begins on the last Sunday of March, at 1 o’clock 
UTC in the morning, when the clocks are moved 
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WORLD MAP OF TIME ZONES 



Fig. 2 .33 The time zones. The map gives the difference of the local zonal time from the Greenwich mean time (UT). During daylight saving time, one hour must be added to the 
given figures. When travelling across the date line westward, the date must be incremented by one day, and decremented if going eastward. For example, a traveller taking a flight 
from Honolulu to Tokyo on Monday morning will arrive on Tuesday, even though (s)he does not see a single night en route. In 2015 North Korea adopted time zone UTC+8.5 
h, which is half an hour behind the South Korean time. (Drawing U.S. Naval Observatory) 
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forward by one hour, and ends on the last Sunday 
of October at 1 o’clock UTC. 


2.14 Astronomical Time Systems 


Time can be defined using several different phe¬ 
nomena: 

1. The solar and sidereal times are based on the 
rotation of the Earth. 

2. The standard unit of time in the current SI sys¬ 
tem, the second, is based on quantum mechan¬ 
ical atomary phenomena. 

3. Equations of physics like the ones describing 
the motions of celestial bodies involve a time 
variable corresponding to an ideal time run¬ 
ning at a constant pace. The ephemeris time 
and dynamical time discussed a little later are 
such times. 

Observations give directly the apparent side¬ 
real time as the hour angle of the true vernal 
equinox. From the apparent sidereal time the 
mean sidereal time can be calculated. 

The universal time UT is defined by the equa¬ 
tion 


GMST(0 UT) = 24,110.54841 s 

+ T x 8,640,184.812866 s 
+ T 2 x 0.093104 s 
- T 3 x 0.0000062 s, 

(2.47) 

where GMST is the Greenwich mean sidereal 
time and T the Julian century. The latter is ob¬ 
tained from the Julian date J, which is a running 
number of the day (Sects. 2.15 and Box 2.2): 


/ — 2,451,545.0 
36,525 


(2.48) 


This gives the time elapsed since January 1, 2000, 
in Julian centuries. 

Sidereal time and hence also UT are related 
to the rotation of the Earth, and thus contain per¬ 
turbations due to the irregular variations, mainly 
slowing down, of the rotation. 

In (2.47) the constant 8,640,184.812866 s 
tells how much sidereal time runs fast compared 


to the UT in a Julian century. As the rotation of 
the Earth is slowing down the solar day becomes 
longer. Since the Julian century T contains a fixed 
number of days, it will also become longer. This 
gives rise to the small correction terms in (2.47). 

Strictly speaking this universal time is the time 
denoted by UT1. Observations give UT0, which 
contains a small perturbation due to the wander¬ 
ing of the geographical pole, or polar variation. 
The direction of the axis with respect to the solid 
surface varies by about 0.1" (a few metres on the 
surface) with a period of about 430 days ( Chan¬ 
dler period). In addition to this, the polar motion 
contains a slow nonperiodic part. 

The z axis of the astronomical coordinates is 
aligned with the angular momentum vector of the 
Earth, but the terrestrial coordinates refer to the 
axis at the epoch 1903.5. In the most accurate cal¬ 
culations this has to be taken into account. 

Due to the polar variation the UT0 does not 
grow at a constant rate, since also the direction 
of the origin, the Greenwich meridian, is vary¬ 
ing. There are also additional irregular variations 
of the order of a millisecond, and the secular 
slowing-down caused by tidal forces. 

Nowadays the SI unit of time, the second, is 
defined in a way that has nothing to do with celes¬ 
tial phenomena. Periods of quantum mechanical 
phenomena remain more stable than the motions 
of celestial bodies involving complicated pertur¬ 
bations. 

In 1967, one second was defined as 
9,192,631,770 times the period of the light emit¬ 
ted by cesium 133 isotope in its ground state, 
transiting from hyperfine level F — 4 to F — 3. 
Later, this definition was revised to include 
small relativistic effects produced by gravita¬ 
tional fields. The relative accuracy of this atomic 
time is about 10 — . The international atomic 
time, TAI, was adopted as the basis of time sig¬ 
nals in 1972. The time is maintained by the Bu¬ 
reau International des Poids et Mesures in Paris, 
and it is the average of several accurate atomic 
clocks. 

Even before atomic clocks there was a need 
for an ideal time proceeding at a perfectly con¬ 
stant rate, corresponding to the time variable 
in the equations of Newtonian mechanics. The 
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ephemeris time was such a time. It was used 
e.g. for tabulating ephemerides. The unit of 
ephemeris time was the ephemeris second , which 
is the length of the tropical year 1900 divided by 
31,556,925.9747. Ephemeris time was not known 
in advance. Only afterwards was it possible to de¬ 
termine the difference of ET and UT from obser¬ 
vational data. 

In 1984 ephemeris time was replaced by dy¬ 
namical time. It comes in two varieties. 

The terrestrial dynamical time (TDT) corre¬ 
sponds to the proper time of an observer moving 
with the Earth. The time scale is affected by the 
relativistic time dilation due to the orbital speed 
of the Earth. The rotation velocity depends on 
the latitude, and thus in TDT it is assumed that 
the observer is not rotating with the Earth. The 
zero point of TDT was chosen so that the old ET 
changed without a jump to TDT. 

In 1991 a new standard time, the terrestrial 
time (TT), was adopted. Practically it is equiva¬ 
lent to TDT. 

TT (or TDT) is the time currently used for tab¬ 
ulating ephemerides of planets and other celestial 
bodies. For example, the Astronomical Almanac 
gives the coordinates of the planets for each day 
at 0 TT. 

The Astronomical Almanac also gives the dif¬ 
ference 

AT — TDT — UT (2.49) 

for earlier years. For the present year and some 
future years a prediction extrapolated from the 
earlier years is given. Its accuracy is about 0.1 s. 
At the beginning of 1990 the difference was 
56.7 s; it increases every year by an amount that 
is usually a little less than one second. 

The terrestrial time differs from the atomic 
time by a constant offset 

TT = TAI + 32.184 s. (2.50) 

TT is well suited for ephemerides of phenom¬ 
ena as seen from the Earth. The equations of mo¬ 
tion of the solar system, however, are solved in 
a frame the origin of which is the centre of mass 
or barycentre of the solar system. The coordinate 
time of this frame is called the barycentric dy¬ 
namical time, TDB. The unit of TDB is defined 


so that, on the average, it runs at the same rate as 
TT, the difference containing only periodic terms 
depending on the orbital motion of the Earth. The 
difference can usually be neglected, since it is at 
most about 0.002 seconds. 

Which of these many times should we use 
in our alarm-clocks? None of them. Yet another 
time is needed for that purpose. This official wall- 
clock time is called the coordinated universal 
time, UTC. The zonal time follows UTC but dif¬ 
fers from it usually by an integral number of 
hours. 

UTC is defined so that it proceeds at the same 
rate as TAI, but differs from it by an integral num¬ 
ber of seconds. These leap seconds are used to ad¬ 
just UTC so that the difference from UT1 never 
exceeds 0.9 seconds (Fig. 2.34). A leap second 
is added either at the beginning of a year or the 
night between June and July. 

The difference 

AAT = TAI — UTC (2.51) 

is also tabulated in the Astronomical Almanac. 
According to the definition of UTC the difference 
in seconds is always an integer. The difference 
cannot be predicted very far to the future. 

From (2.50) and (2.51) we get 

TT = UTC + 32.184 s+AAT, (2.52) 

which gives the terrestrial time TT corresponding 
to a given UTC. Table 2.2 gives this correction. 
The table is easy to extend to the future. When 
it is told in the news that a leap second will be 
added the difference will increase by one second. 
In case the number of leap seconds is not known, 
it can be approximated that a leap second will be 
added every 1.25 years. 

The units of the coordinated universal time 
UTC, atomic time TAI and terrestrial time TT are 
the same second of the SI system. Hence all these 
times proceed at the same rate, the only differ¬ 
ence being in their zero points. The difference of 
the TAI and TT is always the same, but due to the 
leap seconds the UTC will fall behind in a slightly 
irregular way. 

Culminations and rising and setting times of 
celestial bodies are related to the rotation of the 
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Fig. 2.34 The difference between the universal time 
UT1, based on the rotation of the Earth, and the coordi¬ 
nated universal time UTC during 1972-2002. Because the 
rotation of the Earth is slowing down, the UT1 will run 


Table 2.2 Differences of the atomic time and UTC 
(AAT) and the terrestrial time TT and UTC. The terres¬ 
trial time TT used in ephemerides is obtained by adding 
A AT + 32.184 s to the ordinary time UTC 



AAT 

TT - UTC 

1.1.1972-30.6.1972 

10 s 

42.184 s 

1.7.1972-31.12.1972 

11s 

43.184 s 

1.1.1973-31.12.1973 

12 s 

44.184 s 

1.1.1974-31.12.1974 

13 s 

45.184 s 

1.1.1975-31.12.1975 

14s 

46.184 s 

1.1.1976-31.12.1976 

15 s 

47.184 s 

1.1.1977-31.12.1977 

16 s 

48.184 s 

1.1.1978-31.12.1978 

17 s 

49.184 s 

1.1.1979-31.12.1979 

18 s 

50.184 s 

1.1.1980-30.6.1981 

19 s 

51.184 s 

1.7.1981-30.6.1982 

20 s 

52.184 s 

1.7.1982-30.6.1983 

21 s 

53.184 s 

1.7.1983- 30.6.1985 

22 s 

54.184 s 

1.7.1985-31.12.1987 

23 s 

55.184 s 

1.1.1988-31.12.1989 

24 s 

56.184 s 

1.1.1990-31.12.1990 

25 s 

57.184 s 

1.1.1991-30.6.1992 

26 s 

58.184 s 

1.7.1992-30.6.1993 

27 s 

59.184 s 

1.7.1993- 30.6.1994 

28 s 

60.184 s 

1.7.1994-31.12.1995 

29 s 

61.184 s 

1.1.1996-31.6.1997 

30 s 

62.184 s 

1.7.1997-31.12.1998 

31 s 

63.184 s 

1.1.1999-31.12.2005 

32 s 

64.184 s 

1.1.2006- 

33 s 

65.184 s 


slow of the UTC by about 0.8 seconds a year. Leap sec¬ 
onds are added to the UTC when necessary to keep the 
times approximately equal. In the graph these leap sec¬ 
onds are seen as one second jumps upward 

Earth. Thus the sidereal time and hence the UT 
of such an event can be calculated precisely. The 
corresponding UTC cannot differ from the UT by 
more than 0.9 seconds, but the exact value is not 
known in advance. The future coordinates of the 
Sun, Moon and planets can be calculated as func¬ 
tions of the TT, but the corresponding UTC can 
only be estimated. 


2.15 Calendars 

Our calendar is a result of long evolution. The 
main problem it must contend with is the incom¬ 
mensurability of the basic units, day, month and 
year: the numbers of days and months in a year 
are not integers. This makes it rather complicated 
to develop a calendar that takes correctly into ac¬ 
count the alternation of seasons, day and night, 
and perhaps also the lunar phases. 

Our calendar has its origin in the Roman cal¬ 
endar, which, in its earliest form, was based on 
the phases of the Moon. From around 700 B.C. 
on, the length of the year has followed the appar¬ 
ent motion of the Sun; thus originated the division 
of the year into twelve months. One month, how¬ 
ever, still had a length roughly equal to the lunar 
cycle. Hence one year was only 354 days long. 
To keep the year synchronised with the seasons, 
a leap month had to be added to every other year. 

Eventually the Roman calendar got mixed up. 
The mess was cleared by Julius Caesar in about 
46 B.C., when the Julian calendar was developed 
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upon his orders by the Alexandrian astronomer 
Sosigenes. The year had 365 days and a leap day 
was added to every fourth year. 

In the Julian calendar, the average length of 
one year is 365 d 6 h, but the tropical year is 
11 min 14 s shorter. After 128 years, the Julian 
year begins almost one day too late. The differ¬ 
ence was already 10 days in 1582, when a calen¬ 
dar reform was carried out by Pope Gregory XIII. 
In the Gregorian calendar, every fourth year is 
a leap year, the years divisible by 100 being ex¬ 
ceptions. Of these, only the years divisible by 400 
are leap years. Thus 1900 was not a leap year, but 
2000 was. The Gregorian calendar was adopted 
slowly, at different times in different countries. 
Although the calendar reform was seen neces¬ 
sary, orders given by the catholic pope were not 
easy to accept in e.g. protestant countries. The 
transition period did not end before the 20th cen¬ 
tury. 

Even the Gregorian calendar is not perfect. 
The differences from the tropical year will accu¬ 
mulate to one day in about 3300 years. 

Since years and months of variable length 
make it difficult to compute time differences, 
especially astronomers have employed various 
methods to give each day a running number. The 
most widely used numbers are the Julian dates. 
In spite of their name, they are not related to the 
Julian calendar. The only connection is the length 
of a Julian century of 36,525 days, a quantity ap¬ 
pearing in many formulas involving Julian dates. 
The Julian day number 0 dawned in 4713 B.C. 
The day number changes always at 12 : 00 UT. 
For example, the Julian day 2,451,545 began at 
noon in January 1, 2000. The Julian date can be 
computed using the formulas given in Box 2.2. 

Julian dates are uncomfortably big numbers, 
and therefore modified Julian dates are often 
used. The zero point can be e.g. January 1, 2000. 
Sometimes 0.5 is subtracted from the date to 
make it to coincide with the date corresponding to 
the UTC. When using such dates, the zero point 
should always be mentioned. 


Box 2.1 (Reduction of Coordinates) Star cat¬ 
alogues give coordinates for some standard 


epoch. In the following we give the formulas 
needed to reduce the coordinates to a given date 
and time. The full reduction is rather labori¬ 
ous, but the following simplified version is suf¬ 
ficient for most practical purposes. 

We assume that the coordinates are given for 
the epoch J2000.0. 

1. First correct the place for proper motion un¬ 
less it is negligible. 

2. Precess the coordinates to the time of the 
observation. First we use the coordinates of 
the standard epoch (ao, ^o) to find a unit 
vector pointing in the direction of the star: 

( cosi5ocosao 
cos <5o sin ao 
sin So 

Precession changes the ecliptic longitude of 
the object. The effect on right ascension and 
declination can be calculated as three rota¬ 
tions, given by three rotation matrices. By 
multiplying these matrices we get the com¬ 
bined precession matrix that maps the pre¬ 
vious unit vector to its precessed equiva¬ 
lent. A similar matrix can be derived for the 
nutation. The transformations and constants 
given here are based on the system standard¬ 
ised by the IAU in 1976. 

The precession and nutation matrices 
contain several quantities depending on 
time. The time variables appearing in their 
expressions are 

t = J — 2,451,545.0, 

J — 2,451,545.0 

T = -----. 

36,525 

Here J is the Julian date of the observa¬ 
tion, t the number of days since the epoch 
J2000.0 (i.e. noon of January 1, 2000), and 
T the same interval of time in Julian cen¬ 
turies. 

The following three angles are needed 
for the precession matrix 

f =2306.2181"r + 0.30188"r 2 

+ 0.017998"T 3 , 
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z = 2306.2181"P + 1.09468 "T 2 
+ 0.018203"r 3 , 

9 = 2004.3109" P - 0.42665"r 2 

-0.041833"P 3 . 

The precession matrix is now 

/ Pn Pi 2 Pis\ 

P — I ^21 P22 P 23 I ■ 

V-P31 P32 P 33 J 

The elements of this matrix in terms of the 
abovementioned angles are 

P] 1 = cos z cos 6 cost; — sin z sin £, 

P] 2 = — cos z cos 9 sin£ — sin z cos £, 

P 13 — — cos z sin 9, 

P 21 — sin z cos 6 cos (, + cos z sin £, 

P 22 — — sinzcos@ sin£ + coszcos£, 

P 23 = — sinzsin0, 

P 31 = sin 6 cos 
P 32 = — sind sinti, 

P 33 = COS0. 

The new coordinates are now obtained by 
multiplying the coordinates of the standard 
epoch by the precession matrix: 

Pi = PPo- 

This is the mean place at the given time and 
date. 

If the standard epoch is not J2000.0, it is 
probably easiest to first transform the given 
coordinates to the epoch J2000.0. This can 
be done by computing the precession matrix 
for the given epoch and multiplying the co¬ 
ordinates by the inverse of this matrix. In¬ 
verting the precession matrix is easy: we 
just transpose it, i.e. interchange its rows 
and columns. Thus coordinates given for 
some epoch can be precessed to J2000.0 by 


multiplying them by 

/ Pn P21 P3A 

P 1 = I P12 P22 P32 I • 

\Pl 3 P 23 P33) 

In case the required accuracy is higher than 
about one minute of arc, we have to do the 
following further corrections. 

3. The full nutation correction is rather com¬ 
plicated. The nutation used in astronomical 
almanacs involves series expansions con¬ 
taining over a hundred terms. Very often, 
though, the following simple form is suffi¬ 
cient. We begin by finding the mean obliq¬ 
uity of the ecliptic at the observation time: 

e 0 = 23° 26' 21.448" - 46.8150" T 
- 0.00059"P 2 + 0.001813"P 3 . 

The mean obliquity means that periodic per¬ 
turbations have been omitted. The formula 
is valid a few centuries before and after the 
year 2000 . 

The true obliquity of the ecliptic, s, is ob¬ 
tained by adding the nutation correction to 
the mean obliquity: 

£ = £0 + A £. 

The effect of the nutation on the ecliptic 
longitude (denoted usually by Ai fr) and the 
obliquity of the ecliptic can be found from 

Ci = 125° - 0.05295°f, 

C 2 = 200.9° + 1.97129°?, 

A f = -0.0048° sin Ci - 0.0004° sinC 2 , 
As = 0.0026° cos Ci + 0.0002° cos C 2 . 

Since Ai// and As are very small angles, 
we have, for example, sinAi/f Ai jr and 
cos Ai/r 1 , when the angles are expressed 
in radians. Thus we get the nutation matrix 

( 1 —At/rcose —Ai/rsineX 

Ai jr cos g 1 —As I . 

Ai/rsine As 1 / 
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This is a linearised version of the full trans¬ 
formation. The angles here must be in radi¬ 
ans. The place in the coordinate frame of the 
observing time is now 

Pl = NPi- 

4. The annual aberration can affect the place 
about as much as the nutation. Approximate 
corrections are obtained from 

Act cos S — — 20.5" sin a sin X 

— 18.8" cosa cos A, 

A5 = 20.5"cosa sin<5 sin! 

+ 18.8" sina sin<5 cos X 

— 8.1"cos<5 cosA, 

where X is the ecliptic longitude of the Sun. 
Sufficiently accurate value for this purpose 
is given by 

G = 357.528° + 0.985600°r, 

1 = 280.460° + 0.985647° r 

+ 1.915° sinG + 0.020° sin2G. 

These reductions give the apparent place of 
the date with an accuracy of a few seconds 
of arc. The effects of parallax and diurnal 
aberration are even smaller. 

Example The coordinates of Regulus 
(a Leo) for the epoch J2000.0 are 

a = 10 h 8 min 22.2 s = 10.139500 h, 

5= 11° 58'02"= 11.967222°. 

Find the apparent place of Regulus on March 
12, 1995. 

We start by finding the unit vector corre¬ 
sponding to the catalogued place: 

/_0.86449829\ 
p 0 = 0.45787318 . 

\ 0.20735204 ) 

The Julian date is / = 2,449,789.0, and 
thus t = -1756 and T = -0.04807666. The 


angles of the precession matrix are f = 
-0.03079849°, z = -0.03079798° and 9 = 
—0.02676709°. The precession matrix is then 

P = 

/ 0.99999931 0.00107506 0.00046717 \ 

-0.00107506 0.99999942 -0.00000025 . 

\—0.00046717 -0.00000025 0.99999989 ) 

The precessed unit vector is 

/—0.86390858\ 
p, = 0.45880225 . 

V 0.20775577 / 

The angles needed for the nutation are 
Af = 0.00309516°, Ae = -0.00186227°, 
e = 23.43805403°, which give the nutation 
matrix 

N — 

1 -0.00004956 —0.00002149\ 

0.00004956 1 0.00003250 . 

0.00002149 -0.00003250 1 ) 

The place in the frame of the date is 

/—0.86393578\ 
p 2 = 0.45876618 , 

\ 0.20772230 / 

whence 

a = 10.135390 h, 

5= 11.988906°. 

To correct for the aberration we first find the 
longitude of the Sun: G = -1373.2° = 66.8°, 
X — — 8.6°. The correction terms are then 

Acr = 18.25" = 0.0050°, 

A<5 = —5.46" = —0.0015°. 

Adding these to the previously obtained coor¬ 
dinates we get the apparent place of Regulus 
on March 12, 1995: 

a — 10.1357 h = 10 h 8 min 8.5 s, 

5= 11.9874°= 11° 59'15". 
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Comparison with the places given in the cat¬ 
alogue Apparent Places of Fundamental Stars 
shows that we are within about 3" of the correct 
place, which is a satisfactory result. 


Box 2.2 (Julian Date) There are several meth¬ 
ods for finding the Julian date. The following 
one, developed by Fliegel and Van Flandern in 
1968, is well adapted for computer programs. 
Let y be the year (with all four digits), m the 
month and d the day. The Julian date J at noon 
is then 

J = 367 y - {7[v + (m + 9)/12]}/4 
-(3{[y + (m-9)/7]/100+l})/4 
+ 275m/9 + d + 1,721,029. 

The division here means an integer division, the 
decimal part being truncated: e.g. 7/3 = 2 and 
-7/3 =-2. 

Example Find the Julian date on January 1, 
1990. 

Now y = 1990, m = 1 and d — 1. 

/ = 367 x 1990 - 7 x [1990 + (1 + 9)/12]/4 
-3 x {[1990+(1 —9)/7]/100+ l}/4 
+ 275 x 1/9 + 1 + 1,721,029 
= 730,330 - 3482 - 15 + 30+1 
+ 1,721,029 
= 2,447,893. 

Astronomical tables usually give the Julian 
date at 0 UT. In this case that would be 
2,447,892.5. 

The inverse procedure is a little more com¬ 
plicated. In the following J is the Julian date at 
noon (so that it will be an integer): 

a — J + 68,569, 

fc= (4fl)/146,097, 

c = a- (146,0976 + 3)/4, 

</ = [4000(c+ 1)]/1,461,001, 


e = c- (1461d)/4+31, 

/ = (80e)/2447, 
day — e — (2447/)/80, 

8=f/11 , 

month = / + 2 — I2g, 
year = 100(fo — 49) + d + g. 

Example In the previous example we got 
J — 2,447,893. Let’s check this by calculating 
the corresponding calendar date: 

a = 2,447,893 + 68,569 = 2,516,462, 

b = (4 x 2,516,462)/146,097 = 68, 

c = 2,516,462 - (146,097 x 68 + 3)/4 

= 32,813, 

d = [4000(32,813 + 1)]/1,461,001 = 89, 

e = 32,813- (1461 x 89)/4 + 31 = 337, 

/ = (80 x 337)/2447 = 11, 

day = 337 - (2447 x 11)/80= 1, 

*=11/11 = 1, 

month =11 + 2— 12 x 1 = 1, 

year = 100(68 - 49) + 89 + 1 = 1990. 

Thus we arrived back to the original date. 

Since the days of the week repeat in seven 
day cycles, the remainder of the division 
J/I unambiguously determines the day of the 
week. If J is the Julian date at noon, the re¬ 
mainder of Jp tells the day of the week in the 
following way: 

0 = Monday, 

5 = Saturday, 

6 = Sunday. 

Example The Julian date corresponding 
to January 1, 1990 was 2,447,893. Since 
2,447,893 = 7 x 349,699, the remainder is 
zero, and the day was Monday. 
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2.16 Examples 

Example 2.1 (Trigonometric Functions in a Rect¬ 
angular Spherical Triangle) Let the angle A be 
a right angle. When the figure is a plane triangle, 
the trigonometric functions of the angle B would 
be: 

sin B=b/a, cos B —c/a, tan B = b/c. 

For the spherical triangle we have to use the 
equations in (2.7), which are now simply: 

sin B sina = sinb, 
cos B sin a = cos b sin c, 
cos a = cosh cos c. 



The first equation gives the sine of B : 
sin B = sinh/sina. 

Dividing the second equation by the third one, we 
get the cosine of B : 

cos B — tan c/tan a. 

And the tangent is obtained by dividing the first 
equation by the second one: 

tan B = tanb/sinc. 

The third equation is the equivalent of the Pytha¬ 
gorean theorem for rectangular triangles. 

Example 2.2 (Distance Between Two Locations) 
The latitude of Helsinki is about 0i = 60° and 
longitude A] = 25°. The corresponding coordi¬ 
nates of La Palma are <p 2 = 28.7° and a 2 = 
— 17.9°. What is the distance between these lo¬ 
cations measured along the surface of the Earth. 

The distance can be found by applying the co¬ 
sine formula to the spherical triangle NHP : 


cosfl = cos(k 2 ~ ki) sin(90° — <p\) 
x sin(90° — 02 ) 

+ cos(90° — 0i) cos(90° — 02 ) 

= COS(A-2 — k 1) COS 01 COS 02 
+ sin 0 i sin 02 ) 

= 0.7325 x 0.5 x 0.8771 
+ 0.8660 x 0.4802 
= 0.7372, 

from which a — 42.5°. 

Another method is to use vector calculus (Ap¬ 
pendix A). The radius vector with respect to the 
centre of the Earth is 

r = R(cos0cosk, cos0sink, sin0), 

where R is the radius of the Earth. In the follow¬ 
ing, only unit vectors are needed, and thus we can 
omit the constant R. In the example th unit vec¬ 
tors are 

r\ = (0.4532, 0.2113, 0.8660), 
r 2 = (0.8347, -0.2696, 0.4802). 

The scalar product of these vectors gives the co¬ 
sine of the angle between the vectors: 

cos a = r\ ■ r 2 

= 0.4532 x 0.8347 - 0.2113 x 0.2696 
+ 0.8660 x 0.4802 
= 0.7372, 

from which a — 42.5° = 0.7419 rad. 

Both methods give the same angular sepa¬ 
ration between the two locations. The distance 
along the great circle on the surface of the Earth 
is then Ra — 6400 x 0.7419 = 4748 km. 

Example 2.3 (The Coordinates of New York 
City) The geographic coordinates are 41° north 
and 74° west of Greenwich, or 0 = +41°, k = 
—74°. In time units, the longitude would be 
74/15 h = 4 h 56 min west of Greenwich. The 
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geocentric latitude is obtained from 


tan </>' 



/6,356,752 
\6,378,137 


2 

tan 41° 


= 0.86347 =4> (/)' = 40° 48' 34". 


The geocentric latitude is 1T 26" less than the ge¬ 
ographic latitude. 


Example 2.4 The angular separation of two ob¬ 
jects in the sky is quite different from their coor¬ 
dinate difference. 

Suppose the coordinates of a star A are a\ = 

10 h, <5 1 =70° and those of another star B, a 2 = 

11 h, 82 = 80°. 

Using the Pythagorean theorem for plane tri¬ 
angles, we would get 

<7 = V (15°) 2 + (10 °) 2 = 18°. 


Example 2.5 Find the altitude and azimuth of 
the Moon in Helsinki at midnight at the begin¬ 
ning of 1996. 

The right ascension is a = 2 h 55 min 7 s = 
2.9186 h and declination 8 = 14°42' = 14.70°, 
the sidereal time is © = 6 h 19 min 26 s = 
6.3239 h and latitude </> = 60.16°. 

The hour angle is h — © — a = 3.4053 h = 
51.08°. Next we apply the equations in (2.16): 

sin A cos a = sin 51.08° cos 14.70° = 0.7526, 
cos A cos a = cos 51.08° cos 14.70° sin 60.16° 
-sin 14.70° cos 60.16° 

= 0.4008, 

sin a = cos 51.08° cos 14.70° cos 60.16° 

+ sin 14.70° sin 60.16° 


But if we use the third equation in (2.7), we get 


= 0.5225. 


cos d = cos(ai — 0 * 2 ) 

x sin(90° -5 1 )sin(90° - S 2 ) 


Thus the altitude is a — 31.5°. To find the az¬ 
imuth we have to compute its sine and cosine: 


+ cos(90° — <5i)cos(90° — <5 2 ) 
= cos(o:i — a 2 ) cos Si cos S 2 
+ sin Si sinS 2 
= cos 15° cos 70° cos 80° 

+ sin 70° sin 80° 


sinA = 0.8827, cosA = 0.4701. 

Hence the azimuth is A = 62.0°. The Moon is 
in the southwest, 31.5 degrees above the horizon. 
Actually, this would be the direction if the Moon 
were infinitely distant. 


= 0.983, 



which yields d — 10.6°. The figure shows why 
the result obtained from the Pythagorean theorem 
is so far from being correct: hour circles (circles 
with a = constant) approach each other towards 
the poles and their angular separation becomes 
smaller, though the coordinate difference remains 
the same. 


Example 2.6 Find the topocentric place of the 
Moon in the case of the previous example. 

The geocentric distance of the Moon at that 
time is R = 62.58 equatorial radii of the Earth. 
For simplicity, we can assume that the Earth is 
spherical. 

We set up a rectangular coordinate frame in 
such a way that the z axis points towards the 
celestial pole and the observing site is in the 
xz plane. When the radius of the Earth is used 
as the unit of distance, the radius vector of the 
observing site is 


COS</)\ 

/0.4976 

0 

= 0 

sin (p J 

\0.8674 
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The radius vector of the Moon is 


real time at the moment Arcturus rises or sets in 
Boston (0 = 42° 19'). 

Neglecting refraction, we get 

cos h — — tan 19° lTtan42° 19 / 

= -0.348 x 0.910=-0.317. 


Hence, h = ±108.47° = 7 h 14 min. The more 
accurate result is 


/ cos 8 cos h \ / 0.6077 \ 

r — R | — cos 5 sin h I = 62.58 | —0.7526 J 
\ sin 5 J V 0 2538 / 


cos /7 = — tan 19° 1 Y tan42° 19' 
sin 35' 

cos 19° IT cos 42° 19' 


The topocentric place of the Moon is 


= -0.331, 


r —r 


r o = 



We divide this vector by its length 62.07 to get 
the unit vector e pointing to the direction of the 
Moon. This can be expressed in terms of the 
topocentric coordinates S' and Jr : 


whence h = ±109.35° = 7 h 17 min. The plus 
and minus signs correspond to setting ant ris¬ 
ing, respectively. When Arcturus rises, the side¬ 
real time is 

0 — a ± h — 14 h 16 min — 7 h 17 min 
= 6 h 59 min 


/ 0.6047 \ / cos S' cos W \ 

e= | —0.7588 I = | — cosS'sinh' I , 

V 0.2420 j V sin 3 ' / 

which gives S' = 14.00° and h' = 51.45°. Next 
we can calculate the altitude and azimuth as in 
the previous example, and we get a — 30.7°, A — 
61.9°. 

Another way to find the altitude is to take the 
scalar product of the vectors e and r o, which 
gives the cosine of the zenith distance: 

cosz — e ■ ro — 0.6047 x 0.4976 
± 0.2420 x 0.8674 
= 0.5108, 

whence z = 59.3° and a = 90° — z — 30.7°. We 
see that this is 0.8° less than the geocentric alti¬ 
tude; i.e. the difference is more than the apparent 
diameter of the Moon. 

Example 2.7 The coordinates of Arcturus are 
a = 14 h 15.7 min, 5 = 19° IT. Find the side- 


and when it sets, the sidereal time is 

0 = 14 h 16 min ± 7 h 17 min 
= 21 h 33 min. 

Note that the result is independent of the date: 
a star rises and sets at the same sidereal time ev¬ 
ery day. 

Example 2.8 The proper motion of Aldebaran is 
/x = 0.20"/a and parallax tz — 0.048". The spec¬ 
tral line of iron at X — 440.5 nm is displaced 
0.079 nm towards the red. What are the radial and 
tangential velocities and the total velocity? 

The radial velocity is found from 

AA Vr 

X c 

0.079 o 

=* ”'=4403‘ 3Xl ° m ' S 

= 5.4 x 10 4 m/s = 54 km/s. 
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The tangential velocity is now given by (2.40), 
since // and n are in correct units: 

4.74 x 0.20 

v t — 4.74 \ir = 4.74u/tt =- 

' ^' 0.048 

= 20 km/s. 

The total velocity is 

v = yjv} + u t 2 = %/54 2 + 20 2 km/s = 58 km/s. 

Example 2.9 Find the local time in Paris (longi¬ 
tude X — 2°) at 12:00. 

Local time coincides with the zonal time along 
the meridian 15° east of Greenwich. Longitude 
difference 15° - 2 ° = 13° equals (13°/15°) x 
60 min = 52 minutes. The local time is 52 min¬ 
utes less than the official time, or 11:08. This is 
mean solar time. To find the true solar time, we 
must add the equation of time. In early Febru¬ 
ary, E.T. = —14 min and the true solar time is 
11:08 — 14 min = 10:54. At the beginning of 
November, ET = +16 min and the solar time 
would be 11:24. Since —14 min and +16 min are 
the extreme values of E.T., the true solar time is 
in the range 10:54-11:24, the exact time depend¬ 
ing on the day of the year. During daylight saving 
time, we must still subtract one hour from these 
times. 

Example 2.10 (Estimating Sidereal Time) Since 
the sidereal time is the hour angle of the vernal 
equinox T, it is 0 h when T culminates or transits 
the south meridian. At the moment of the vernal 
equinox, the Sun is in the direction of T and thus 
culminates at the same time as T. So the sidereal 
time at 12:00 local solar time is 0 : 00 , and at the 
time of the vernal equinox, we have 

© = T + 12 h, 

where T is the local solar time. This is accu¬ 
rate within a couple of minutes. Since the sidereal 
time runs about 4 minutes fast a day, the sidereal 
time, n days after the vernal equinox, is 

+ 12h + nx4 min. 

At autumnal equinox T culminates at 0:00 local 
time, and sidereal and solar times are equal. 


Let us try to find the sidereal time in Paris 
on April 15 at 22:00, Central European standard 
time (= 23:00 daylight saving time). The vernal 
equinox occurs on the average on March 21; thus 
the time elapsed since the equinox is 10 + 15 = 
25 days. Neglecting the equation of time, the lo¬ 
cal time T is 52 minutes less than the zonal time. 
Hence 

® = 7’ + 12h + nx4 min 

= 21 h 8 min + 12 h + 25 x 4 min 

= 34 h 48 min = 10 h 48 min. 

The time of the vernal equinox can vary about one 
day in either direction from the average. There¬ 
fore the accuracy of the result is roughly 5 min. 

Example 2.11 Find the rising time of Arcturus 
in Boston on January 10. 

In Example 2.6 we found the sidereal time 
of this event, © = 6 h 59 min. Since we do not 
know the year, we use the rough method of Ex¬ 
ample 2.9. The time between January 1 and ver¬ 
nal equinox (March 21) is about 70 days. Thus 
the sidereal time on January 1 is 

© « T + 12 h — 70 x 4 min = T + 7 h 20 min, 

from which 

T — © — 7 h 20 min = 6 h 59 min — 7 h 20 min 

= 30 h 59 min — 7 h 20 min = 23 h 39 min. 

The longitude of Boston is 71° W, and the Eastern 
standard time is (4°/15°) x 60 min =16 minutes 
less, or 23:23. 

Example 2.12 Find the sidereal time in Helsinki 
on April 15, 1982 at 20:00 UT. 

The Julian date is J = 2,445,074.5 and 

2,445,074.5-2,451,545.0 

T = -------- 

36,525 

= -0.1771526. 

Next, we use (2.47) to find the sidereal time at 
OUT: 

@o = -1,506,521.0 s = -418 h 28 min 41 s 
= 13 h 31 min 19 s. 
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Since the sidereal time runs 3 min 57 s fast a day 
as compared to the solar time, the difference in 
20 hours will be 

20 

— x 3 min 57 s = 3 min 17 s, 

24 

and the sidereal time at 20 UT will be 13 h 31 min 
19 s + 20 h 3 min 17 s = 33 h 34 min 36 s = 9 h 
34 min 36 s. 

At the same time (at 22:00 Finnish time, 
23:00 daylight saving time) in Helsinki the side¬ 
real time is ahead of this by the amount cor¬ 
responding to the longitude of Helsinki, 25°, 
i.e. 1 h 40 min 00 s. Thus the sidereal time is 
11 h 14 min 36 s. 


2.17 Exercises 

Exercise 2.1 Find the distance between Helsinki 
and Seattle along the shortest route. Where is the 
northernmost point of the route, and what is its 
distance from the North Pole? The longitude of 
Helsinki is 25°E and latitude 60°; the longitude 
of Seattle is 122° W and latitude 48°. Assume that 
the radius of the Earth is 6370 km. 

Exercise 2.2 A star crosses the south meridian 
at an altitude of 85°, and the north meridian at 
45°. Find the declination of the star and the lati¬ 
tude of the observer. 

Exercise 2.3 Where are the following state¬ 
ments true? 

(a) Castor (a Gem, declination 8 = 31 ° 53') is 
circumpolar. 

(b) Betelgeuze (a Ori, 8 — 7° 24') culminates at 
zenith. 

(c) a Cen (8 = —60° 50') rises to an altitude of 
30°. 

Exercise 2.4 In his Old Man and the Sea Hem¬ 
ingway wrote: 

It was dark now as it becomes dark quickly after 
the Sun sets in September. He lay against the worn 


wood of the bow and rested all that he could. The 
first stars were out. He did not know the name of 
Rigel but he saw it and knew soon they would all 
be out and he would have all his distant friends. 

How was Hemingway’s astronomy? 

Exercise 2.5 The right ascension of the Sun on 
June 1, 1983, was 4h 35 min and declination 
22° 00'. Find the ecliptic longitude and latitude 
of the Sun and the Earth. 

Exercise 2.6 Show that on the Arctic Circle the 
Sun 

(a) rises at the same sidereal time 0q between 
December 22 and June 22, 

(b) sets at the same sidereal time ©o between 
June 22 and December 22. 

What is @o? 

Exercise 2.7 Derive the equations (2.24), which 
give the galactic coordinates as functions of the 
ecliptic coordinates. 

Exercise 2.8 The coordinates of Sirius for the 
epoch 1900.0 were a — 6 h 40 min 45 s, 8 — 
— 16° 35', and the components of its proper mo¬ 
tion were fi a = —0.037 s/a, pts — —1.12"a -1 . 
Find the coordinates of Sirius for 2000.0. The 
precession must also be taken into account. 

Exercise 2.9 The parallax of Sirius is 0.375" 
and radial velocity —8 km/s. 

(a) What are the tangential and total velocities of 
Sirius? (See also the previous exercise.) 

(b) When will Sirius be closest to the Sun? 

(c) What will its proper motion and parallax be 
then? 

Exercise 2.10 The average period of the vari¬ 
able star Mira Ceti is 331.96 days. According to 
a catalogue the brightness was at maximum in 
September 22, 2000. When the star was brightest 
in 2010? 
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Up to the end of the Middle Ages, the most im¬ 
portant means of observation in astronomy was 
the human eye. It was aided by various mechan¬ 
ical devices to measure the positions of celes¬ 
tial bodies in the sky. The telescope was invented 
in Holland at the beginning of the 17th century, 
and in 1609 Galileo Galilei made his hist astro¬ 
nomical observations with this new instrument. 
Astronomical photography was introduced at the 
end of the 19th century, and during the last few 
decades many kinds of electronic detectors have 
been adopted for the study of electromagnetic 
radiation from space. The electromagnetic spec¬ 
trum from the shortest gamma rays to long radio 
waves can now be used for astronomical observa¬ 
tions. 


3.1 Observing Through the 
Atmosphere 

With satellites and spacecraft, observations can 
be made outside the atmosphere. Yet, the great 
majority of astronomical observations are carried 
out from the surface of the Earth. In the preceding 
chapter, we discussed refraction, which changes 
the apparent altitudes of objects. The atmosphere 
affects observations in many other, more seri¬ 
ous ways as well. Real problems are the weather, 
restlessness of the atmosphere, and, worst of all, 
opacity of the atmosphere in many wavelength re¬ 
gions. 

The air is never quite steady, and there are lay¬ 
ers with different temperatures and densities; this 
causes convection and turbulence. When the light 


from a star passes through the unsteady air, rapid 
changes in refraction in different directions re¬ 
sult. Thus, the amount of light reaching a detec¬ 
tor, e.g. the human eye, constantly varies; the star 
is said to scintillate (Fig. 3.1). Planets shine more 
steadily, since they are not point sources like the 
stars, and variations in the light coming from dif¬ 
ferent parts partly cancel each others. 

When a wavefront passes an aperture its am¬ 
plitude and phase vary with time. The amplitude 
variations cause scintillation and phase changes 
blur the image. A telescope collects light over 
a larger area, which evens out rapid changes 
and diminishes scintillation. Instead, differences 
in the phase do not cancel out the same way; 
they smear the image and point sources are seen 
in telescopes as vibrating speckles. This phe¬ 
nomenon is called seeing. When the exposure 
time is at least a few seconds the light oscillates 
in all directions, and the stellar image becomes a 
round disk (Fig. 3.3). The diameter of the seeing 
disk can vary from less than an arc second to even 
tens of arc seconds. If the size of the disk small, 
the seeing is good. It is only in good conditions 
on high mountains that the seeing is less than one 
arc second. 

Seeing limits how small details can be seen 
with a telescope. Sometimes the atmosphere can 
calm down for a short period of time, and then it 
is possible to glimpse visually details that are not 
seen in pictures exposed for a longer time. 

Some wavelength regions in the electromag¬ 
netic spectrum are strongly absorbed by the atmo¬ 
sphere (Fig. 3.2). The most important transparent 
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interval is the optical window from about 300 to 
800 nm. This interval coincides with the region of 
sensitivity of the human eye (about 400-700 nm). 

At wavelengths under 300 nm absorption by 
atmospheric ozone prevents radiation from reach¬ 
ing the ground. The ozone is concentrated in 
a thin layer at a height of about 20-30 km, and 
this layer protects the Earth from harmful ultra¬ 
violet radiation. At still shorter wavelengths, the 


main absorbers are O 2 , N 2 and free atoms. Nearly 
all of the radiation under 300 nm is absorbed by 
the upper parts of the atmosphere. 

At wavelengths longer than visible light, in 
the near-infrared region, the atmosphere is fairly 
transparent up to 1.3 pm. There are some absorp¬ 
tion belts caused by water and molecular oxy¬ 
gen, but the atmosphere gets more opaque only 
at wavelengths of longer than 1.3 pm. At these 


Fig. 3.1 Scintillation of 
Sirius during four passes 
across the field of view. 
The star was very low on 
the horizon. (Photo by 
Pekka Parviainen) 
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Fig. 3.3 Stellar images vibrate about equally often in all 
directions, on the average. If the picture is exposed for 
more than about one second the image of the star spreads 
into a circular seeing disc. The better the observing condi¬ 
tions are the smaller is the diameter of the disc 

wavelengths, radiation reaches the lower parts of 
the atmosphere only in a few narrow windows. 
All wavelengths between 20 pm and 1 mm are 
totally absorbed. 

At wavelengths longer than 1 mm, there is the 
radio window extending up to about 20 m. At 
still longer wavelengths, the ionosphere in the up¬ 
per parts of the atmosphere reflects all radiation 
(Fig. 3.2). The exact upper limit of the radio win¬ 
dow depends on the strength of the ionosphere, 
which varies during the day. (The structure of the 
atmosphere is described in Chap. 7.) 

At optical wavelengths (300-800 nm), light 
is scattered by the molecules and dust in the at¬ 
mosphere, and the radiation is attenuated. Scat¬ 
tering and absorption together are called extinc¬ 
tion. Extinction must be taken into account when 
one measures the brightness of celestial bodies 
(Chap. 4). 

In the 19th century Lord Rayleigh succeeded 
in explaining why the sky is blue. Scattering 
caused by the molecules in the atmosphere is in¬ 
versely proportional to the fourth power of the 
wavelength. Thus, blue light is scattered more 
than red light. The blue light we see all over the 
sky is scattered sunlight. The same phenomenon 



Fig. 3.4 Night views from the top of Mount Wilson. The 
upper photo was taken in 1908, the lower one in 1988. The 
lights of Los Angeles, Pasadena, Hollywood and more 
than 40 other towns are reflected in the sky, causing con¬ 
siderable disturbance to astronomical observations. (Pho¬ 
tos by Ferdinand Ellerman and International Dark-Sky As¬ 
sociation) 

colours the setting sun red, because owing to the 
long, oblique path through the atmosphere, all the 
blue light has been scattered away. 

In astronomy one often has to observe very 
faint objects. Thus, it is important that the back¬ 
ground sky be as dark as possible, and the atmo¬ 
sphere as transparent as possible. That is why the 
large observatories have been built on mountain 
tops far from the cities. The air above an observa¬ 
tory site must be very dry, the number of cloudy 
nights few, and the seeing good. 

Astronomers have looked all over the Earth for 
optimal conditions and have found some excep¬ 
tional sites. In the 1970’s, several new major ob¬ 
servatories were founded at these sites. Among 
the best sites in the world are: the extinct vol¬ 
cano Mauna Kea on Hawaii, rising more than 
4000 m above the sea; the dry mountains in north¬ 
ern Chile; the Sonoran desert in the U.S., near 
the border of Mexico; and the mountains on La 
Palma, in the Canary Islands. Many older ob¬ 
servatories are severely plagued by the lights of 
nearby cities (Fig. 3.4). 
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In radio astronomy atmospheric conditions are 
not very critical except when observing at the 
shortest wavelengths, damped by atmospheric 
moisture. Constructors of radio telescopes have 
much greater freedom in choosing their sites than 
optical astronomers. Still, radio telescopes are 
also often constructed in uninhabited places to 
isolate them from disturbing radio and television 
broadcasts. 


3.2 Optical Telescopes 

The telescope fulfils three major tasks in astro¬ 
nomical observations: 

1. It collects light from a large area, making it 
possible to study very faint sources. 

2. It improves resolution and increases the appar¬ 
ent angular diameter of the object. 

3. It is used to measure the positions of objects. 

The light-collecting surface in a telescope 
is either a lens or a mirror. Thus, optical tele¬ 
scopes are divided into two types, lens telescopes 
or refractors and mirror telescopes or reflectors 
(Fig.' 3.5). 

Geometrical Optics Refractors have two len¬ 
ses, the objective which collects the incoming 
light and forms an image in the focal plane, and 
the eyepiece which is a small magnifying glass 
for looking at the image (Fig. 3.5). The lenses 
are at the opposite ends of a tube which can be 
directed towards any desired point. The distance 
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Fig. 3.5 A lens telescope or refractor and a mirror tele¬ 
scope or reflector 


between the eyepiece and the focal plane can be 
adjusted to get the image into focus. The image 
formed by the objective lens can also be regis¬ 
tered, e.g. on a photographic film, as in an ordi¬ 
nary camera. 

The diameter of the objective, D, is called the 
aperture of the telescope. The ratio of the aper¬ 
ture D to the focal length /, F — D/f, is called 
the aperture ratio. This quantity is used to charac¬ 
terise the light-gathering power of the telescope. 
If the aperture ratio is large, near unity, one has 
a powerful, “fast” telescope; this means that one 
can take photographs using short exposures, since 
the image is bright. A small aperture ratio (the fo¬ 
cal length much greater than the aperture) means 
a “slow” telescope. 

In astronomy, as in photography, the aperture 
ratio is often denoted by f/n (e.g. //8), where 
n is the focal length divided by the aperture. For 
fast telescopes this ratio can be //1... //3, but 
usually it is smaller, //8... //15. 

The scale of the image formed in the focal 
plane of a refractor can be geometrically deter¬ 
mined from Fig. 3.6. When the object is seen at 
the angle u, it forms an image of height s , 

s = ftanu^fu, (3.1) 

since u is a very small angle. If the telescope has 
a focal length of, for instance, 343 cm, one arc 
minute corresponds to 

s = 343 cm x Y 
= 343 cm x (1/60) x (tt/180) 

= 1 mm. 



Fig. 3.6 The scale and magnification of a refractor. The 
object subtends an angle u. The objective forms an image 
of the object in the focal plane. When the image is viewed 
through the eyepiece, it is seen at an angle i / 
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Fig. 3.7 Diffraction and resolving power. The image of 
a single star (a) consists of concentric diffraction rings, 
which can be displayed as a mountain diagram (b). Wide 
pairs of stars can be easily resolved (c). For resolving close 

The magnification co is (from Fig. 3.6) 

co=u'/u « ///', (3.2) 

where we have used the equation s = fu. Flere, 
/ is the focal length of the objective and /' that 
of the eyepiece. For example, if / = 100 cm and 
we use an eyepiece with f — 2 cm, the magni¬ 
fication is 50-fold. The magnification is not an 
essential feature of a telescope, since it can be 
changed simply by changing the eyepiece. 

A more important characteristic, which de¬ 
pends on the aperture of the telescope, is the re¬ 
solving power, which determines, for example, 
the minimum angular separation of the compo¬ 
nents of a binary star that can be seen as two sep¬ 
arate stars. The theoretical limit for the resolution 
is set by the diffraction of light: The telescope 
does not form a point image of a star, but rather 
a small disk, since light “bends around the cor¬ 
ner” like all radiation (Fig. 3.7). 

The theoretical resolution of a telescope is of¬ 
ten given in the form introduced by Rayleigh (see 
Box 3.1) 

sin0 ^0 = 1.22 k/D, [0] = rad. (3.3) 


binaries, different criteria can be used. One is the Rayleigh 
limit 1.22 A ./D (d). In practice, the resolution can be writ¬ 
ten A/D, which is near the Dawes limit (e). (Photo (a) Sky 
and Telescope) 

As a practical rule, we can say that two objects 
are seen as separate if the angular distance be¬ 
tween them is 

0 > A/D, [0] = rad. (3.4) 

This formula can be applied to optical as well as 
radio telescopes. For example, if one makes ob¬ 
servations at a typical yellow wavelength (A = 
550 nm), the resolving power of a reflector with 
an aperture of 1 m is about 0.1". However, seeing 
spreads out the image to a diameter of typically 
one arc second. Thus, the theoretical diffraction 
limit cannot usually be reached on the surface of 
the Earth. 

In photography the resolution is further re¬ 
stricted by the properties of the photographic 
plate or the pixel size of the CCD camera, de¬ 
creasing the resolution as compared with vi¬ 
sual observations. The grain size of photographic 
emulsions is about 0.01-0.03 mm, which is also 
the minimum size of the image. For a focal length 
of 1 m, the scale is 1 mm = 206", and thus 
0.01 mm corresponds to about 2 arc seconds. This 
is similar to the theoretical resolution of a tele¬ 
scope with an aperture of 7 cm in visual obser- 
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vations. The pixel sizes of CCD cameras have 
decreased, and currently the resolution can be as 
good as or even better than the resolution of pho¬ 
tographic films. 

In practice, the resolution of visual observa¬ 
tions is determined by the ability of the eye to see 
details. In night vision (when the eye is perfectly 
adapted to darkness) the resolving capability of 
the human eye is about 2'. 

The maximum magnification ® max is the 
largest magnification that is worth using in tele¬ 
scopic observations. Its value is obtained from 
the ratio of the resolving capability of the eye, 
e ~ 2' = 5.8 x 10 -4 rad, to the resolving power 
of the telescope, 9, 


tt>max — e/6 


eD/X = 


5.8 x 1CT 4 D 
5.5 x 10 -7 m 


» D/l mm. 


(3.5) 


If we use, for example, an objective with a di¬ 
ameter of 100 mm, the maximum magnification 
is about 100. If larger magnifications are used 
the object is seen bigger but no more details 
are resolved. Especially small and cheap amateur 
telescopes are advertised having huge magnifi¬ 
cations, which are useless; rather they make ob¬ 
serving very difficult. However, experienced ob¬ 
servers may sometimes use very big magnifica¬ 
tions in some special occasions. 

The minimum magnification &> min is the small¬ 
est magnification that is useful in visual observa¬ 
tions. Its value is obtained from the condition that 
the diameter of the exit pupil L of the telescope 
must be smaller than or equal to the pupil of the 
eye. 

The exit pupil is the image of the objective 
lens, formed by the eyepiece, through which the 
light from the objective goes behind the eyepiece. 
From Fig. 3.8 we obtain 

/ 7 D 

L= J -D= (3.6) 
/ ® 

Thus the condition L <d means that 


co > D/d. (3.7) 


/ f 



Fig. 3.8 The exit pupil L is the image of the objective 
lens formed by the eyepiece 



Fig. 3.9 Chromatic aberration. Light rays of different 
colours are refracted to different focal points (left). The 
aberration can be corrected with an achromatic lens con¬ 
sisting of two parts (right) 


In the night, the diameter of the pupil of the hu¬ 
man eye is about 6 mm, and thus the minimum 
magnification of a 100 mm telescope is about 17. 

Refractors In the first refractors, which had 
a simple objective lens, the observations were 
hampered by the chromatic aberration. Since 
glass refracts different colours by different 
amounts, all colours do not meet at the same fo¬ 
cal point (Fig. 3.9), but the focal length increases 
with increasing wavelength. To remove this aber¬ 
ration, achromatic lenses consisting of two parts 
were developed in the 18th century. The colour 
dependence of the focal length is much smaller 
than in single lenses, and at some wavelength, ko, 
the focal length has an extremum (usually a min¬ 
imum). Near this point the change of focal length 
with wavelength is very small (Fig. 3.10). If the 
telescope is intended for visual observations, we 
choose ko = 550 nm, corresponding to the maxi¬ 
mum sensitivity of the eye. Objectives for photo¬ 
graphic refractors were usually constructed with 
ko ~ 425 nm, since normal photographic plates 
were most sensitive to the blue part of the spec¬ 
trum. 


















3.2 Optical Telescopes 


57 



Fig. 3.10 The wavelength dependence of the focal length 
of a typical achromatic objective for visual observations. 
The focal length has a minimum near X = 550 nm, where 
the eye is most sensitive. In bluer light (X = 450 nm) or 
in redder light (A. = 800 nm), the focal length increases by 
a factor of about 1.002 

By combining three or even more lenses of dif¬ 
ferent glasses in the objective, the chromatic aber¬ 
ration can be corrected still better (as in apochro- 
matic objectives). Also, special glasses have been 
developed where the wavelength dependences of 
the refractive index cancel out so well that two 
lenses already give a very good correction of the 
chromatic aberration. They have, however, hardly 
been used in astronomy so far. 

The largest refractors in the world have an 
aperture of about one metre (102 cm in the Yerkes 
Observatory telescope (Fig. 3.11), finished in 
1897, and 91 cm in the Lick Observatory tele¬ 
scope (1888)). The aperture ratio is typically 
// 10 ...// 20 . 

The use of refractors is limited by their small 
field of view and awkwardly long structure. Re¬ 
fractors are still used, e.g. in solar telescopes and 
various meridian telescopes for measuring the po¬ 
sitions of stars, and also for visual observations of 
binary stars. 

A wider field of view is obtained by using 
more complex lens systems, and telescopes of 
this kind are called astrographs. Astrographs 
have an objective made up of typically 3-5 lenses 
and an aperture of less than 60 cm. The aperture 
ratio is //5 ... f/1 and the field of view about 5°. 
Astrographs are used to photograph large areas of 
the sky, e.g. for proper motion studies and for sta¬ 
tistical brightness studies of the stars. 


Reflectors By far the most common telescope 
type in astrophysical research nowadays is the 
mirror telescope or reflector. As a light-collecting 
surface, it employs a mirror coated with a thin 
layer of aluminium. The form of the mirror is usu¬ 
ally parabolic. A parabolic mirror reflects all light 
rays entering the telescope parallel to the main 
axis into the same focal point. The image formed 
at this point can be observed through an eyepiece 
or registered with a detector. One of the advan¬ 
tages of reflectors is the absence of chromatic 
aberration, since all wavelengths are reflected to 
the same point. 

In the very largest telescopes, the instruments 
can be installed at the primary focus (Fig. 3.12) 
without eclipsing too much of the incoming light. 
Sometimes even the observer could sit in a cage 
at the primary focus. In smaller telescopes, this 
is not possible, and the image must be inspected 
from outside the telescope. In modern telescopes 
instruments are remotely controlled, and the ob¬ 
server must stay away from the telescope to re¬ 
duce thermal turbulence. 

The idea of a reflector was described in 1663 
by James Gregory. The first practical reflector, 
however, was built by Isaac Newton. Fie guided 
the light perpendicularly out from the telescope 
with a small flat mirror. Therefore the focus of 
the image in such a system is called the New¬ 
ton focus. A typical aperture ratio of a Newtonian 
telescope is // 3 ... f/10. Another possibility is 
to bore a hole at the centre of the primary mirror 
and reflect the rays through it with a small hy¬ 
perbolic secondary mirror in the front end of the 
telescope. In such a design, the rays meet in the 
Cassegrain focus. Cassegrain systems have aper¬ 
ture ratios of //8... //15. 

The effective focal length (/ e ) of a Cassegrain 
telescope is determined by the position and con¬ 
vexity of the secondary mirror. Using the nota¬ 
tions of Fig. 3.13, we get 

/e = -/p. (3.8) 

a 

If we choose a <C fc, we have / e / p . In this 
way one can construct short telescopes with long 
focal lengths. Cassegrain systems are especially 
well suited for spectrographic, photometric and 
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other instruments, which can be mounted in the 
secondary focus, easily accessible to the observer. 

More complicated arrangements use several 
mirrors to guide the light through the declination 
axis of the telescope to a fixed coude focus (from 


the French word couder, to bend), which can even 
be situated in a separate room near the telescope 
(Fig. 3.14). The focal length is thus very long 
and the aperture ratio //30... //40. The coude 
focus is used mainly for accurate spectroscopy. 


Fig. 3.11 The largest 
refractor in the world is at 
the Yerkes Observatory, 
University of Chicago. It 
has an objective lens with 
a diameter of 102 cm. 
(Photo by Yerkes 
Observatory) 




Fig. 3.12 Different locations of the focus in reflectors: observations near the celestial pole. More complex coude 
primary focus, Newton focus, Cassegrain focus and coude systems usually have three flat mirrors after the primary 
focus. The coude system in this figure cannot be used for and secondary mirrors 
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since the large spectrographs can be stationary 
and their temperature can be held accurately con¬ 
stant. Nowadays this can also be achieved by us¬ 
ing optical fibers. A drawback is that much light 
is lost in the reflections in the several mirrors 
of the coude system. An aluminised mirror re¬ 
flects about 80 % of the light falling on it, and 
thus in a coude system of, e.g. five mirrors (in¬ 
cluding the primary and secondary mirrors), only 



Fig. 3.13 The principle of a Cassegrain reflector. A con¬ 
cave (paraboloid) primary mirror M\ reflects the light rays 
parallel to the main axis towards the primary focus Si- 
A convex secondary mirror M 2 (hyperboloid) reflects the 
rays back through a small hole at the centre of the main 
mirror to the secondary focus S 2 outside the telescope 


0.8 5 ~ 30 % of the light reaches the detector. In 
new big telescopes the large instruments are usu¬ 
ally mounted at the Nasmyth focus at the end of 
the horizontal axis of the telescope. 

The reflector has its own aberration, coma. It 
affects images displaced from the optical axis. 
Light rays do not converge at one point, but form 
a figure like a comet. Due to the coma, the clas¬ 
sical reflector with a paraboloid mirror has a very 
small correct field of view. The coma limits the 
diameter of the useful field to 2-20 minutes of 
arc, depending on the aperture ratio of the tele¬ 
scope. The 5 m Palomar telescope, for instance, 
has a useful field of view of about 4', correspond¬ 
ing to about one-eighth of the diameter of the 
Moon. In practice, the small field of view can be 
enlarged by various correcting lenses. 

If the primary mirror were spherical, there 
would be no coma. However, this kind of mir¬ 
ror has its own error, spherical aberration: light 
rays from the centre and edges converge at dif¬ 
ferent points. To remove the spherical aberration, 
the Estonian astronomer Bernhard Schmidt de¬ 
veloped a thin correcting lens that is placed in 


Insulated 



Fig. 3.14 The coude system of the Kitt Peak 2.1m reflector. (Drawing National Optical Astronomy Observatories, Kitt 
Peak National Observatory) 
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the way of the incoming light. Schmidt cameras 
(Figs. 3.15 and 3.16) have a very wide (about 7°), 
nearly faultless field of view, and the correcting 
lens is so thin that it absorbs very little light. The 
images of the stars are very sharp. In Schmidt 
telescopes the diaphragm with the correcting lens 
is positioned at the centre of the radius of curva¬ 
ture of the mirror (this radius equals twice the fo¬ 
cal length). To collect all the light from the edges 
of the field of view, the diameter of the mirror 
must be larger than that of the correcting glass. 
The Palomar Schmidt camera, for example, has 
an aperture of 122 cm (correcting lens)/183 cm 
(mirror) and a focal length of 300 cm. The largest 
Schmidt telescope in the world is in Tauten- 
burg, Germany, and its corresponding values are 
134/203/400 cm. 

A disadvantage of the Schmidt telescope is 
the curved focal plane, consisting of a part of 
a sphere. When the telescope is used for pho¬ 
tography, the plate must be bent along the curved 
focal plane. Another possibility of correcting the 
curvature of the field of view is to use an extra 
correcting lens near the focal plane. Such a so¬ 
lution was developed by the Finnish astronomer 
Yrjo Vaisala in the 1930’s, independently of 
Schmidt. Such a system is often called a Schmidt- 



Fig. 3.15 The principle of the Schmidt camera. A cor¬ 
recting glass at the centre of curvature of a concave spher¬ 
ical mirror deviates parallel rays of light and compensates 
for the spherical aberration of the spherical mirror. (In the 
figure, the form of the correcting glass and the change of 
direction of the light rays have been greatly exaggerated.) 
Since the correcting glass lies at the centre of curvature, 
the image is practically independent of the incoming an¬ 
gle of the light rays. Thus there is no coma or astigmatism, 
and the images of stars are points on a spherical surface 
at a distance of R/ 2, where R is the radius of curvature 
of the spherical mirror. In photography, the plate must be 
bent into the form of the focal surface, or the field rectified 
with a corrector lens 


Vaisala telescope. The chips of CCD cameras are 
usually much smaller than photographic plates, 
and then the curved focal surface is not a prob¬ 
lem. 

Schmidt cameras have proved to be very ef¬ 
fective in mapping the sky. They have been used 
to photograph the Palomar Sky Atlas mentioned 
in the previous chapter and its continuation, the 
ESO/SRC Southern Sky Atlas. 

The Schmidt camera is an example of a cata- 
dioptric telescope , which has both lenses and 
mirrors. Schmidt-Cassegrain telescopes used by 
many amateurs are modifications of the Schmidt 
camera. They have a secondary mirror mounted 
at the centre of the correcting lens; the mirror re¬ 
flects the image through a hole in the primary mir¬ 
ror. Thus the effective focal length can be rather 
long, although the telescope itself is very short. 
Another common catadioptric telescope is the 
Maksutov telescope. Both surfaces of the correct¬ 
ing lens as well as the primary mirror of a Mak¬ 
sutov telescope are concentric spheres. 

Another way of removing the coma of the 
classical reflectors is to use more complicated 
mirror surfaces. The Ritchey-Chretien system 
has hyperboloidal primary and secondary mir¬ 
rors, providing a fairly wide useful field of view. 
Ritchey-Chretien optics are used in many large 
telescopes. 

Mountings of Telescopes A telescope has to be 
mounted on a steady support to prevent its shak¬ 
ing, and it must be smoothly rotated during obser¬ 
vations. There are two principal types of mount¬ 
ing, equatorial and azimuthal (Fig. 3.17). 

In the equatorial mounting, one of the axes is 
directed towards the celestial pole. It is called the 
polar axis or hour axis. The other one, the decli¬ 
nation axis , is perpendicular to it. Since the hour 
axis is parallel to the axis of the Earth, the appar¬ 
ent rotation of the sky can be compensated for by 
turning the telescope around this axis at a con¬ 
stant rate. 

The declination axis is the main technical 
problem of the equatorial mounting. When the 
telescope is pointing to the south its weight 
causes a force perpendicular to the axis. When 
the telescope is tracking an object and turns west- 
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Fig. 3.16 The large 
Schmidt telescope of the 
European Southern 
Observatory. The diameter 
of the mirror is 1.62 m and 
of the free aperture 1 m. 
(Photo ESO) 



ward, the bearings must take an increasing load 
parallel with the declination axis. 

In the azimuthal mounting, one of the axes is 
vertical, the other one horizontal. This mounting 
is easier to construct than the equatorial mount¬ 
ing and is more stable for very large telescopes. 
In order to follow the rotation of the sky, the tele¬ 
scope must be turned around both of the axes 
with changing velocities. The field of view will 
also rotate; this rotation must be compensated for 
when the telescope is used for photography. 

If an object goes close to the zenith, its az¬ 
imuth will change 180° in a very short time. 
Therefore, around the zenith there is a small re¬ 
gion where observations with an azimuthal tele¬ 
scope are not possible. 


The largest telescopes in the world were equa- 
torially mounted until the development of com¬ 
puters made possible the more complicated guid¬ 
ance needed for azimuthal mountings. Most of 
the recently built large telescopes are already az- 
imuthally mounted. Azimuthally mounted tele¬ 
scopes have two additional obvious places for 
foci, the Nasmyth foci at both ends of the hori¬ 
zontal axis. 

The Dobson mounting, used in many amateur 
telescopes, is azimuthal. The magnification of the 
Newtonian telescope is usually small, and the 
telescope rests on pieces of teflon, which make 
it very easy to move. Thus the object can easily 
be tracked manually. 
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Fig. 3.17 The equatorial mounting {left) and the azimuthal mounting (right) 


Another type of mounting is the coelostat, 
where rotating mirrors guide the light into a sta¬ 
tionary telescope. This system is used especially 
in solar telescopes. 

To measure absolute positions of stars and ac¬ 
curate time, telescopes aligned with the north- 
south direction are used. They can be rotated 
around one axis only, the east-west horizontal 
axis. Meridian circles or transit instruments with 
this kind of mounting were widely constructed 
for different observatories during the 19th cen¬ 
tury. A few are still used for astrometry, but they 
are now highly automatic. The most important of 
the still operational instruments is that of the U.S. 
Naval Observatory in Flagstaff. The meridian cir¬ 
cle on La Palma funded by the Carlsberg founda¬ 
tion was closed in 2013. 

New Techniques Detectors are already ap¬ 
proaching the theoretical limit of efficiency, 
where all incident photons are registered. Ulti¬ 
mately, to detect even fainter objects the only 
solution is to increase the light gathering area 
(Fig. 3.18), but also the mirrors are getting close 
to the practical maximum size. Thus, new techni¬ 
cal solutions are needed. 


One new feature is active optics, used in most 
new big telescopes. The mirror is very thin, but its 
shape is kept exactly correct by a computer con¬ 
trolled support mechanism. The weight and pro¬ 
duction cost of such a mirror are much smaller 
compared with a conventional thick mirror. Be¬ 
cause of the smaller weight also the supporting 
structure can be made lighter. 

Developing the control mechanism further 
leads to adaptive optics. A reference star (or an 
artificial beam) is monitored constantly in order 
to obtain the shape of the seeing disk. The attitude 
or the shape of a small auxiliary mirror is adjusted 
up to hundreds of times a second to keep the im¬ 
age as concentrated as possible. Adaptive optics 
has been taken into use in the largest telescopes 
of the world from about the year 2000 on. 

The mirrors of large telescopes need not be 
monolithic, but can be made of smaller pieces 
that are, e.g. hexagonal. These mosaic mirrors 
are very light and can be used to build up mir¬ 
rors with diameters of several tens of metres 
(Fig. 3.20). Using active optics, the hexagons can 
be accurately focused. The California Associa¬ 
tion for Research in Astronomy has constructed 
the William M. Keck telescope with a 10 m mo- 
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Fig. 3.18 The largest 
telescopes in the world in 
1947-2000. (a) For nearly 
30 years, the 5.1 m Hale 
telescope on Mount 
Palomar, California, USA, 
was the largest telescope in 
the world, (b) The BTA, 
Big Azimuthal Telescope, 
is situated in the Caucasus 
in the southern Soviet 
Union. Its mirror has 
a diameter of 6 m. It was 
set in operation at the end 
of 1975. (c) The William 
M. Keck Telescope on the 
summit of Mauna Kea, 
Hawaii, was completed in 
1992. The 10 m mirror 
consists of 36 hexagonal 
segments. Another similar 
telescope next to it was 
completed in 1996. (Photos 
Palomar Observatory, 
Spetsialnaya 
Astrofizitsheskaya 
Observatorya, and Roger 
Ressmeyer-Starlight for 
the California Association 
for Research in 
Astronomy) 



saic mirror. It is located on Mauna Kea, and the 
last segment was installed in 1992. Also the E- 
ELT telescope (Fig. 3.21) planned by ESO will 
have a mosaic mirror. 

The reflecting surface does not have to be con¬ 
tinuous, but can even consist of several separate 
telescopes. Next to the Keck another similar tele¬ 
scope, the Keck II, was completed in 1996. The 
pair can be used as an interferometer that has the 
same resolving power as a single 75 metre mir¬ 
ror. 

The European Southern Observatory has con¬ 
structed its own multi-mirror telescope. ESO’s 
Very Large Telescope (VLT) has four closely lo¬ 
cated telescopes (Fig. 3.19). The diameter of each 
mirror is eight metres, and the total area corre¬ 


sponds to one telescope with a 16 m mirror. The 
resolution is even better, since the “aperture”, i.e. 
the maximum distance between the mirrors, is 
several tens of meters. 

An important astronomical instruments of the 
20th century is the Hubble Space Telescope, 
launched in 1990 (Fig. 3.22). It has a mirror with 
a diameter of 2.4 m. The resolution of the tele¬ 
scope (after the faulty optics was corrected) is 
near the theoretical diffraction limit, since there 
is no disturbing atmosphere. A second genera¬ 
tion Space Telescope, now called the James Webb 
Space Telescope , will have a mosaic mirror of 
6.5 m. The launch has been postponed mainly for 
financial reasons. The current estimates for the 
launch are around 2018. 
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Fig. 3.18 (Continued) 



Space telescopes avoid the atmospheric per¬ 
turbations. However, comparable results are 
nowadays obtained using the adaptive optics. Due 
to budgetary reasons, the majority of astronomi¬ 
cal observations will still be carried out on the 
Earth, and great attention will be given to improv¬ 
ing ground-based observatories and detectors. 

In the future, satellites will continue to be 
mainly used for those wavelength regions where 
the radiation is absorbed by the atmosphere. 
(Fig. 3.2 and Sect. 3.5). 


3.3 Detectors and Instruments 

Only a limited amount of information can be ob¬ 
tained by looking through a telescope with the 


unaided eye. Yet until the end of the 19th cen¬ 
tury this was the only way to make observations. 
The invention of photography in the middle of 
the 19th century brought a revolution in astron¬ 
omy. The next important step forward in optical 
astronomy was the development of photoelectric 
photometry in the 1940’s and 1950’s. A new revo¬ 
lution, comparable to that caused by the invention 
of photography, took place in the middle of the 
1970’s with the introduction of different semicon¬ 
ductor detectors. The sensitivity of detectors has 
grown so much that today, a 60 cm telescope can 
be used for observations similar to those made 
with the Palomar 5 m telescope when it was set 
in operation in the 1940’s. 
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Fig. 3.18 (Continued) 



The Photographic Plate Photography has long 
been one of the most common methods of obser¬ 
vation in astronomy. In astronomical photogra¬ 
phy glass plates were used, rather than film, since 
they keep their shape better, but nowadays they 
are no more manufactured, and CCD-cameras 
have largely replaced photography. The sensitive 
layer on the surface of the film or plate is made up 
of a silver halide, usually silver bromide, AgBr. 
A photon absorbed by the halide excites an elec¬ 
tron that can move from one atom to another. 
A silver ion, Ag + , can catch the electron, becom¬ 
ing a neutral atom. When the necessary amount 
of silver atoms have been accumulated at one 
place, they form a latent image. The latent image 
can be made into a permanent negative by treat¬ 


ing the plate after exposure with various chemi¬ 
cals, which transform the silver bromide crystals 
enclosing the latent image into silver (“develop¬ 
ment”), and remove the unexposed crystals (“fix¬ 
ing”). 

The photographic plate has many advantages 
over the human eye. The plate can register up to 
millions of stars (picture elements) at one time, 
while the eye can observe at most one or two 
objects at a time. The image on a plate is prac¬ 
tically permanent—the picture can be studied at 
any time. In addition, the photographic plate is 
cheap and easy to use, as compared to many other 
detectors. The most important feature of a plate 
is its capability to collect light over an extended 
time: the longer exposures are used, the more sil- 
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Fig. 3.19 Some new large telescopes, (a) The European 
Southern Observatory (ESO) was founded by Belgium, 
France, the Netherlands, Sweden and West Germany in 
1962. Other European countries have joined them later. 
The VLT (Very Large Telescope) on Cerro Paranal in 
Northern Chile, was inaugurated in 1998-2000. (b) Cur¬ 
rently the largest telescope is the GTC (Gran Telesco- 


pio Canarias or GranTeCan) on La Palma. The observa¬ 
tions with the 10.4 m mirror were made in 2009. (c) The 
largest binocular is the LBT (Large Binocular Telescope) 
on Mount Graham in Arizona, completed in 2008. It has 
two 8.4 metre mirrors. (Photos ESO, IAC, Max Planck 
Institute) 


ver atoms are formed on the plate (the plate dark¬ 
ens). By increasing the exposure times, fainter 
objects can be photographed. The eye has no such 


capacity: if a faint object does not show through 
a telescope, it cannot been seen, no matter how 
long one stares. 
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Fig. 3.19 (Continued) 



Fig. 3.20 The mirror of 
a telescope can be made up 
of several smaller 
segments, which are much 
easier to manufacture, as in 
the Hobby-Eberle 
Telescope on Mount 
Fowlkes, Texas. The 
effective diameter of the 
mirror is 9.1 m. A similar 
telescope is being built in 
South Africa. (Photo 
MacDonald Observatory) 



One disadvantage of the photographic plate is 
its low sensitivity. Only one photon in a thou¬ 
sand causes a reaction leading to the formation 
of a silver grain. Thus the quantum efficiency of 
the plate is only 0.1 %. Several chemical treat¬ 
ments can be used to sensitise the plate before 
exposure. This brings the quantum efficiency up 
to a few percent. Another disadvantage is the fact 
that a silver bromide crystal that has been ex¬ 
posed once does not register anything more, i.e. 


a saturation point is reached. On the other hand, 
a certain number of photons are needed to pro¬ 
duce an image. Doubling the number of pho¬ 
tons does not necessarily double the density (the 
‘blackness’ of the image): the density of the plate 
depends nonlinearly on the amount of incoming 
light (Fig. 3.23). The sensitivity of the plate is 
also strongly dependent on the wavelength of the 
light. For the reasons mentioned above the ac¬ 
curacy with which brightness can be measured 
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Fig. 3.21 In order to observe objects essentially fainter 
than nowadays the size of the objective must be increased 
considerably. The European Southern Observatory ESO 
started to make plans for a 100 metre OWL telescope 
(Overwhelmingly Large telescope), but when the price 
turned out to be “astronomical” the size has been reduced 

Fig. 3.22 The Hubble 
Space Telescope after the 
latest service flight in 2009. 

The telescope got new 
solar panels and several 
other upgrades. (Photo 
NASA) 


several times. When the E-ELT telescope (European Ex¬ 
tremely Large Telescope) will be completed the main mir¬ 
ror will have a diameter of 39 metres. The mirror will con¬ 
sist of 798 hexagonal pieces, each having a diameter of 
1.45 metres. (ESO) 



on a photographic plate is usually worse than 
about 5 %. Thus the photographic plate makes 
a poor photometer, but it can be excellently used, 
e.g. for measuring the positions of stars (posi¬ 
tional astronomy) and for mapping the sky. Old 
photography archives are still valuable for study¬ 
ing proper motions and brightness variations that 
are not too minuscule. 


Photocathodes, Photomultipliers A photo- 
cathode is a more effective detector than the pho¬ 
tographic plate. It is based on the photoelectric 
effect. A light quantum, or photon, hits the pho¬ 
tocathode and loosens an electron. The electron 
moves to the positive electrode, or anode, and 
gives rise to an electric current that can be mea¬ 
sured. The quantum efficiency of a photocathode 
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Fig. 3.23 The density of a photographic emulsion de¬ 
pends on the incident light. If there is very little light it has 
no effect. Even then the film is not completely transparent 
since it has a light background fog. When the amount of 
light increases the density will begin to grow first nonlin- 
early. When the exposure is optimal, which depends of the 
type of the film, the dependence of the density on exposure 
is nearly linear. When there is too much light, all grains of 
the film become dark and the image is saturated 

is about 10-20 times better than that of a pho¬ 
tographic plate; optimally, an efficiency of 30 % 
can be reached. A photocathode is also a linear 
detector: if the number of electrons is doubled, 
the outcoming current is also doubled. 

The photomultiplier is one of the most impor¬ 
tant applications of the photocathode. In this de¬ 
vice, the electrons leaving the photocathode hit 
a dynode. For each electron hitting the dynode, 
several others are released. When there are sev¬ 
eral dynodes in a row, the original weak current 
can be intensified a millionfold. The photomulti¬ 
plier measures all the light entering it, but does 
not form an image. Photomultipliers are mostly 
used in photometry, and an accuracy of 0.1-1 % 
can be attained. 

Photometers, Polarimeters A detector mea¬ 
suring brightness, a photometer , is usually lo¬ 
cated behind the telescope in the Cassegrain fo¬ 
cus. In the focal plane there is a small hole, the di¬ 
aphragm, which lets through light from the object 
under observation. In this way, light from other 
stars in the field of view can be prevented from 
entering the photometer. A field lens behind the 
diaphragm refracts the light rays onto a photo¬ 
cathode. The outcoming current is intensified fur- 


Dia- Colli- Semitransparent 

phragm mator mirrors 



Fig. 3.24 The principle of a photoelectric multicolour 
photometer. Light collected by the telescope arrives from 
the left. The light enters the photometer through a small 
hole in the focal plane, the diaphragm. A lens collimates 
the light into a parallel beam. Semitransparent mirrors di¬ 
vide the beam to several photomultipliers. A field lens 
guides the light through a filter onto the photocathode of 
the photomultiplier. The quanta of light, photons, release 
electrons from the cathodes. The electrons are accelerated 
towards the dynodes with a voltage of about 1500 V. The 
electrons hitting the dynodes release still more electrons, 
and the current is greatly enhanced. Every electron emit¬ 
ted from the cathode gives rise to a pulse of up to 10 8 elec¬ 
trons at the anode; the pulse is amplified and registered by 
a pulse counter. In this way, the photons from the star are 
counted 

ther in a preamplifier. The photomultiplier needs 
a voltage of 1000-1500 volts. 

Observations are often made in a certain wave¬ 
length interval, instead of measuring all the ra¬ 
diation entering the detector. In this case a filter 
is used to prevent other wavelengths from reach¬ 
ing the photomultiplier. A photometer can also 
consist of several photomultipliers (Fig. 3.23), 
which measure simultaneously different wave¬ 
length bands. In such an instrument beam split¬ 
ters or semitransparent mirrors split the light 
beam through fixed filters to the photomultipliers. 

In a device called the photopolarimeter , a po¬ 
larising filter is used, either alone or in combina¬ 
tion with other filters. The degree and direction of 
polarisation can be found by measuring the inten¬ 
sity of the radiation with different orientations of 
the polarisers. 

In practice, the diaphragm of a photometer 
will always also let through part of the back¬ 
ground sky around the observed object. The mea¬ 
sured brightness is in reality the combined bright¬ 
ness of the object and the sky. In order to find the 
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brightness of the object, the background bright¬ 
ness must be measured separately and subtracted 
from the combined brightness. The accuracy of 
the measurements is decreased if long observa¬ 
tion times are used and the background bright¬ 
ness undergoes fast changes. The problem can be 
solved by observing the brightness of the back¬ 
ground sky and the object simultaneously. 

Photometric observations are often relative. If 
one is observing, e.g. a variable star, a reference 
star close to the actual target is observed at regu¬ 
lar intervals. Using the observations of this ref¬ 
erence star it is possible to derive a model for 
the slow changes in the atmospheric extinction 
(see Chap. 4) and remove their effect. The instru¬ 
ment can be calibrated by observing some stan¬ 
dard stars, whose brightness is known very accu¬ 
rately. 

Image Intensifiers Different image intensifiers 
based on the photocathode have been used since 
the 1960’s. In the intensifier the information 
about the starting point of the electron on the pho¬ 
tocathode is preserved and the intensified image 
is formed on a fluorescent screen. The image can 
then be registered, e.g. with a CCD camera. One 
of the advantages of the image intensifier is that 
even faint objects can be imaged using relatively 
short exposures, and observations can be made at 
wavelengths where the detector is insensitive. 

Another common type of detector is based on 
the TV camera ( Vidicon camera). The electrons 
released from the photocathode are accelerated 
with a voltage of a few kilovolts before they hit 
the electrode where they form an image in the 
form of an electric charge distribution. After ex¬ 
posure, the charge at different points of the elec¬ 
trode is read by scanning its surface with an elec¬ 
tron beam row by row. This produces a video sig¬ 
nal, which can be transformed into a visible im¬ 
age on a TV tube. The information can also be 
saved in digital form. In the most advanced sys¬ 
tems, the scintillations caused by single electrons 
on the fluorescent screen of the image intensi¬ 
fier can be registered and stored in the memory 
of a computer. For each point in the image there 
is a memory location, called a picture element or 
pixel. 


Since the middle of the 1970’s, detectors using 
semiconductor techniques began to be used in in¬ 
creasing numbers. With semiconductor detectors 
a quantum efficiency of about 70-80 % can be at¬ 
tained; thus, sensitivity cannot be improved much 
more. The wavelength regions suitable for these 
new detectors are much wider than in the case of 
the photographic plate. The detectors are also lin¬ 
ear: if the number of photons is doubled, also the 
signal will be doubled. Computers are used for 
collecting, saving and analysing the output data 
available in digital form. 

CCD Camera Currently the most important 
detector is the CCD camera (Charge Coupled De¬ 
vice). The detector consists of a surface made 
up of light sensitive silicon diodes, arranged in 
a rectangular array of image elements or pix¬ 
els. The largest cameras can have as many as 
4096 x 4096 pixels, although most are consid¬ 
erably smaller. Recently, CCD’s have become 
widely used in ordinary digital cameras. 

A photon hitting the detector can release 
an electron, which will remain trapped inside 
a pixel. After the exposure varying potential 
differences are used to move the accumulated 
charges row by row to a readout buffer. In the 
buffer the charges are moved pixel by pixel to 
an analogy/digital converter, which transmits the 
digital value to a computer. Reading an image 
also clears the detector (Fig. 3.24). If the expo¬ 
sures are very short the readout times may take 
a substantial part of the observing time. 

The CCD camera is nearly linear: the number 
of electrons is directly proportional to the num¬ 
ber of photons. Calibration of the data is there¬ 
fore much easier than with photographic plates 
(Fig. 3.25). 

The quantum efficiency, i.e. the number of 
electrons per incident photon, is high, and the 
CCD camera is much more sensitive than a pho¬ 
tographic plate. The sensitivity is highest in the 
red wavelength range, about 600-800 nm, where 
the quantum efficiency can be 80-90 % or even 
higher. 

The range of the camera extends far to the in¬ 
frared. In the ultraviolet the sensitivity drops due 
to the absorption of the silicon very rapidly be¬ 
low about 500 nm. Two methods have been used 
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Fig. 3.25 The principle of reading a CCD camera, 
(a) During an exposure electrons are trapped in potential 
wells corresponding to pixels of the camera. The number 
at each pixel shows the number of electrons, (b) After 
the exposure each horizontal line is moved one pixel to 
the right; the rightmost row moves to the readout buffer, 
(c) The contents of the buffer is moved down by one pixel. 

to avoid this problem. One is to use a coating that 
absorbs the ultraviolet photons and emits light of 
longer wavelength. Another possibility is to turn 
the chip upside down and make it very thin to re¬ 
duce the absorption. 

The thermal noise of the camera generates 
dark current even if the camera is in total dark¬ 
ness. To reduce the noise the camera must be 
cooled. Astronomical CCD cameras are usually 
cooled with liquid nitrogen, which efficiently re¬ 
moves most of the dark current. However, the 
sensitivity is also reduced when the camera is 
cooled; so too cold is not good either. The tem¬ 
perature must be kept constant in order to obtain 
consistent data. For amateurs there are already 
moderately priced CCD cameras, which are elec¬ 
trically cooled. Many of them are good enough 
also for scientific work, if very high sensitivity is 
not required. 

The dark current can easily be measured by 
taking exposures with the shutter closed. Sub¬ 
tracting this from the observed image gives the 
real number of electrons due to incident light 
(Fig. 3.26). 


The lowermost charge moves to the A/D converter, which 
sends the number of electrons to the computer, (d) Af¬ 
ter moving the buffer down several times one vertical row 
has been read, (e) The image is again shifted right by one 
pixel. This procedure is repeated till the whole image is 
read 



Fig. 3.26 The signal measured by a CCD-camera de¬ 
pends almost linearly on the number of photons. During 
the exposure also the dark current will increase and has to 
be subtracted from the observed values. Bias is the dark 
current registered even if the exposure time were zero 

The sensitivity of individual pixels may be 
slightly different. This can be corrected for by 
taking an image of an evenly illuminated field, 
like a twilight sky, where stars are not visible. 
This image is called a flat-field. When observa¬ 
tions are divided by the flat-field, the error caused 
by different pixels is removed (Fig. 3.27). 
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Fig. 3.27 Typical images related to CCD observations. Top left dark current, top right flat field, bottom left original 
raw image and bottom right reduced image 


The CCD camera is very stable. Therefore it is 
not necessary to repeat the dark current and flat- 
field observations very frequently. Typically these 
calibration exposures are taken during evening 
and morning twilights, just before and after ac¬ 
tual observations. 

Cosmic rays are charged particles that can pro¬ 
duce extraneous bright dots in CCD images. They 
are usually limited to one or two pixels, and are 
easily identified. Typically a short exposure of 
a few minutes contains a few traces of cosmic 


rays. Instead of a single long exposure it is usu¬ 
ally better to take several short ones, clean the im¬ 
ages from cosmic rays, and finally add the images 
on a computer. 

A more serious problem is the readout noise 
of the electronics. In the first cameras it could 
be hundreds of electrons per pixel. In modern 
cameras it is a few electrons. This gives a limit 
to the faintest detectable signal: if the signal is 
weaker than the readout noise, it is indistinguish¬ 
able from the noise. Nowadays there are so called 
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L3-cameras (low light level CCD), where elec¬ 
trons are multiplied during reading and the read¬ 
out noise is the only source of noise. 

Although the CCD camera is a very sensi¬ 
tive detector, even bright light cannot damage it. 
A photomultiplier, on the other hand, can be eas¬ 
ily destroyed by letting in too much light. How¬ 
ever, one pixel can only store a certain number of 
electrons, after which it becomes saturated. Ex¬ 
cessive saturation can make the charge to over¬ 
flow also to the neighbouring pixels. If the camera 
becomes badly saturated it may have to be read 
several times to completely remove the charges. 

The largest CCD cameras are quite expensive, 
and even they are still rather small compared 
with the largest photographic plates and films. 
Because they are very sensitive and processing 
of the data is easy, they have, however, replaced 
photographic materials almost completely. 

Spectrographs The simplest spectrograph is 
a prism that is placed in front of a telescope. 
This kind of device is called the objective prism 
spectrograph. The prism spreads out the different 
wavelengths of light into a spectrum which can 
be registered. During the exposure, the telescope 
is usually slightly moved perpendicularly to the 
spectrum, in order to increase the width of the 
spectrum. With an objective prism spectrograph, 
large numbers of spectra can be photographed, 
e.g. for spectral classification (Chap. 9). 

For more accurate information the slit spec¬ 
trograph must be used (Fig. 3.28). It has a nar¬ 
row slit in the focal plane of the telescope. The 
light is guided through the slit to a collimator 
that reflects or refracts all the light rays into 
a parallel beam. After this, the light is dispersed 
into a spectrum by a prism and focused with 
a camera onto a detector, which nowadays is usu¬ 
ally a CCD camera. A comparison spectrum is 
exposed next to the stellar spectrum to deter¬ 
mine the precise wavelengths. In modern spec¬ 
trographs using CCD cameras, the comparison 
spectrum is usually exposed as a separate im¬ 
age. A big slit spectrograph is often placed at the 
coude or Nasmyth focus of the telescope. 

The most important property of a spectrograph 
is the scale or dispersion of the spectrum. The dis- 


Slit Collimator Prism 



Fig. 3.28 The principle of the slit spectrograph. Light 
rays entering through a slit are collimated (made paral¬ 
lel to each other), dispersed into a spectrum by a prism 
and projected onto a photographic plate or a CCD. There 
can also be several prisms. However, grids are used more 
commonly since they give a better resolution 

persion tells how long a wavelength range cor¬ 
responds to one distance unit of the detector. 
For the objective prism the dispersion is typi¬ 
cally a few tens of nanometres per millimetre. Slit 
spectrographs may have a dispersion as high as 
1-0.01 nm/mm, which makes it possible to study 
even the shapes of individual spectral lines. Often 
the dispersion is given as a dimensionless quan¬ 
tity. For instance, the dispersion 1 nm/mm means 
that the wavelength scale has increased million¬ 
fold; thus it can be expressed as 10 6 . 

Instead of the prism a diffraction grating can 
be used to form the spectrum. A grating has nar¬ 
row grooves, side by side, typically several hun¬ 
dred per millimetre. When light is reflected by 
the walls of the grooves, the adjoining rays in¬ 
terfere with each other and give rise to spectra of 
different orders. There are two kinds of gratings: 
reflection and transmission gratings. In a reflec¬ 
tion grating no light is absorbed by the glass as in 
the prism or transmission grating. A grating usu¬ 
ally has higher dispersion, or ability to spread the 
spectrum, than a prism. The dispersion can be in¬ 
creased by increasing the density of the grooves 
of the grating. In slit spectrographs the reflection 
grating is most commonly used. In some instru¬ 
ments a grism is used; it is a prism with a trans¬ 
mission grating on one of its surfaces. 

Interferometers The resolution of a big tele¬ 
scope is in practice limited by seeing, and thus 
increasing the aperture does not necessarily im¬ 
prove the resolution. To get nearer to the theoret¬ 
ical resolution limit set by diffraction (Eq. (3.3), 
Fig. 3.6), different interferometers can be used. 
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There are two types of optical interferome¬ 
ters. One kind uses an existing large telescope; 
the other a system of two or more separate tele¬ 
scopes. In both cases the light rays are allowed 
to interfere. By analysing the outcoming interfer¬ 
ence pattern, the structures of close binaries can 
be studied, apparent angular diameters of the stars 
can be measured, etc. 

One of the earliest interferometers was the 
Michelson interferometer that was built shortly 
before 1920 for the largest telescope of that time. 
In front of the telescope, at the ends of a six me¬ 
tre long beam, there were flat mirrors reflecting 
the light into the telescope. The form of the inter¬ 
ference pattern changed when the separation of 
the mirrors was changed. In practice, the inter¬ 
ference pattern was disturbed by seeing, and only 
a few positive results were obtained with this in¬ 
strument. 

The diameters of over 30 of the brightest stars 
have been measured using intensity interferome¬ 
ters. Such a device consists of two separate tele¬ 
scopes that can be moved in relation to each other. 
This method is suitable for the brightest objects 
only. 

In 1970 the Frenchman Antoine Labeyrie in¬ 
troduced the principle of speckle interferometry. 
In traditional imaging the pictures from long ex¬ 
posures consist of a large number of instanta¬ 
neous images, “speckles”, that together form the 
seeing disk. In speckle interferometry very short 
exposures and large magnifications are used and 
hundreds of pictures are taken. When these pic¬ 
tures are combined and analysed (usually in dig¬ 
ital form), the actual resolution of the telescope 
can nearly be reached. 

The accuracy of interferometric techniques 
was improved at the beginning of 00’s. The first 
experiments to use the two 10 m Keck telescopes 
as one interferometer, were made in 2001. Simi¬ 
larly, the ESO VLT can be used as an interferom¬ 
eter. 


3.4 Radio Telescopes 

Radio astronomy, started in the 1930’s, extended 
the observable range of the electromagnetic spec¬ 
trum by several orders of magnitude. Radio fre¬ 


quencies range from a few megahertz (100 m) 
up to about 300 GHz (1 mm). The low-frequency 
limit of the radio band is determined by the opac¬ 
ity of the ionosphere, while the high-frequency 
limit is due to the strong absorption from oxy¬ 
gen and water bands in the lower atmosphere. 
Neither of these limits is very strict, and un¬ 
der favourable conditions radio astronomers can 
work into the submillimetre region or through 
ionospheric holes during sunspot minima. 

At the beginning of the 20th century attempts 
were made to observe radio emission from the 
Sun. These experiments, however, failed because 
of the low sensitivity of the antenna-receiver sys¬ 
tems, and because of the opaqueness of the iono¬ 
sphere at the low frequencies at which most of 
the experiments were carried out. The first ob¬ 
servations of cosmic radio emission were later 
made by the American engineer Karl G. Jansky 
in 1932, while studying thunderstorm radio dis¬ 
turbances at a frequency of 20.5 MHz (14.6 m). 
He discovered radio emission of unknown ori¬ 
gin, which varied within a 24 hour period. Some¬ 
what later he identified the source of this radi¬ 
ation to be in the direction of the centre of our 
Galaxy. 

The real birth of radio astronomy may perhaps 
be dated to the late 1930’s, when Grote Reber 
started systematic observations with his home¬ 
made 9.5 m paraboloid antenna. Thereafter ra¬ 
dio astronomy developed quite rapidly and has 
greatly improved our knowledge of the Universe. 

Observations are made both in the contin¬ 
uum (broad band) and in spectral lines (radio 
spectroscopy). Much of our knowledge about the 
structure of our Milky Way comes from radio ob¬ 
servations of the 21 cm line of neutral hydrogen 
and, more recently, from the 2.6 mm line of the 
carbon monoxide molecule. Radio astronomy has 
resulted in many important discoveries; e.g. both 
pulsars and quasars were first found by radio as¬ 
tronomical observations. The importance of the 
field can also be seen from the fact that the Nobel 
prize in physics has recently been awarded three 
times to radio astronomers. 

A radio telescope collects radiation in an aper¬ 
ture or antenna, from which it is transformed to 
an electric signal by a receiver, called a radiome- 
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ter. This signal is then amplified, detected and 
integrated, and the output is registered on some 
recording device, nowadays usually by a com¬ 
puter. Because the received signal is very weak, 
one has to use sensitive receivers. These are often 
cooled to minimise the noise, which could other¬ 
wise mask the signal from the source. Because ra¬ 
dio waves are electromagnetic radiation, they are 
reflected and refracted like ordinary light waves. 
In radio astronomy, however, only reflecting tele¬ 
scopes are used. 

At low frequencies the antennas are usually 
dipoles (similar to those used for radio or TV), 
but in order to increase the collecting area and 
improve the resolution, one uses dipole arrays, 
where all dipole elements are connected to each 
other. 

The most common antenna type, however, is 
a parabolic reflector, which works exactly as an 
optical mirror telescope. At long wavelengths the 
reflecting surface does not need to be solid, be¬ 
cause the long wavelength photons cannot see the 
holes in the reflector, and the antenna is there¬ 
fore usually made in the form of a metal mesh. 
At high frequencies the surface has to be smooth, 
and in the millimetre-submillimetre range, radio 
astronomers even use large optical telescopes, 
which they equip with their own radiometers. 
To ensure a coherent amplification of the signal, 
the surface irregularities should be less than one- 
tenth of the wavelength used. 

The main difference between a radio telescope 
and an optical telescope is in the recording of 
the signal. Radio telescopes are not imaging tele¬ 
scopes (except for synthesis telescopes, which 
will be described later); instead, a feed horn, 
which is located at the antenna focus, transfers 
the signal to a receiver. The wavelength and phase 
information is, however, preserved. 

The resolving power of a radio telescope, 9, 
can be deduced from the same formula (3.4) as 
for optical telescopes, i.e. X/D, where X is the 
wavelength used and D is the diameter of the 
aperture. Since the wavelength ratio between ra¬ 
dio and visible light is of the order of 10,000, ra¬ 
dio antennas with diameters of several kilometres 
are needed in order to achieve the same resolution 
as for optical telescopes. In the early days of radio 


astronomy poor resolution was the biggest draw¬ 
back for the development and recognition of radio 
astronomy. For example, the antenna used by Jan¬ 
sky had a fan beam with a resolution of about 30° 
in the narrower direction. Therefore radio obser¬ 
vations could not be compared with optical ob¬ 
servations. Neither was it possible to identify the 
radio sources with optical counterparts. 

The world’s biggest radio telescope is the 
Arecibo antenna in Puerto Rico, whose main re¬ 
flector has a diameter of 305 m. It is a fixed metal 
mesh built over a natural round valley (Fig. 3.29). 
In the late 1970’s the antenna surface and re¬ 
ceivers were upgraded, enabling the antenna to be 
used down to wavelengths of 5 cm. The mirror of 
the Arecibo telescope is not parabolic but spher¬ 
ical, and the antenna is equipped with a movable 
feed system, which makes observations possible 
within a 20° radius around the zenith. 

A similar and only partly steerable radio tele¬ 
scope is being built in China. It will be the largest 
in the world with an antenna diameter of 500 m. 

The biggest completely steerable radio tele¬ 
scope is the Green Bank telescope in Virginia, 
U.S.A., dedicated at the end of 2000. It is slightly 
asymmetric with a diameter of 100 x 110 m 
(Fig. 3.30). Before the Green Bank telescope, for 
over two decades the largest telescope was the 
Effelsberg telescope in Germany. This antenna 
has a parabolic main reflector with a diameter of 
100 m. The inner 80 m of the dish is made of 
solid aluminium panels, while the outmost por¬ 
tion of the disk is a metal mesh structure. By us¬ 
ing only the inner portion of the telescope, it has 
been possible to observe down to wavelengths of 
4 mm. The oldest and perhaps best-known big ra¬ 
dio telescope is the 76 m antenna at Jodrell Bank 
in Britain, which was completed in the end of the 
1950’s. 

The biggest telescopes are usually incapable 
of operating below wavelengths of 1 cm, because 
the surface cannot be made accurate enough. 
However, the millimetre range has become more 
and more important. In this wavelength range 
there are many transitions of interstellar mole¬ 
cules, and one can achieve quite high angular 
resolution even with a single dish telescope. At 
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Fig. 3.29 The largest radio telescope in the world is the Arecibo dish in Puerto Rico. It has been constructed over 
a natural bowl and is 300 m in diameter. (Photo Arecibo Observatory) 



Fig. 3.30 The largest fully steerable radio telescope is in Green Bank, Virginia. Its diameter is 100 x 110 m. (Photo 
NRAO) 


present, the typical size of a mirror of a mil- big millimetre telescopes are in operation (Ta- 
limetre telescope is about 15 m. The develop- ble C.24). Among them are the 40 m Nobeyama 
ment of this field is rapid, and at present several telescope in Japan, which can be used down to 
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Fig. 3.31 The Atacama Large Millimetre Array (ALMA) built in cooperation by Europe, U.S.A. and Japan. The 
telescope has 64 antennas. (Photo ESO/NOAJ) 


3 mm, the 30 m IRAM telescope at Pico Veleta 
in Spain, which is usable down to 1 mm, and the 
15 m. The largest project in the first decade of 
the 21st century is ALMA (Atacama Large Mil¬ 
limetre Array), which consists of 64 telescopes 
with a diameter of 12 m (Fig. 3.31). It was built 
as an international project by the United States, 
Europe and Japan in Chile on the Chajnantor 
plateau. 

As already mentioned, the resolving power of 
a radio telescope is far poorer than that of an op¬ 
tical telescope. The biggest radio telescopes can 
at present reach a resolution of 5 arc seconds, 
and that only at the very highest frequencies. To 
improve the resolution by increasing the size is 
difficult, because the present telescopes are al¬ 
ready close to the practical upper limit. However, 
by combining radio telescopes and interferome¬ 
ters, it is possible to achieve even better resolu¬ 
tion than with optical telescopes. 

As early as 1891 Michelson used an interfer¬ 
ometer for astronomical purposes. While the use 
of interferometers has proved to be quite difficult 
in the optical wavelength regime, interferometers 



Fig. 3.32 The principle of an interferometer. If the ra¬ 
diation reaches the radio telescopes in the same phase, 
the waves amplify each other and a maximum is obtained 
in the combined radiation (cases 1 and 3). If the incom¬ 
ing waves are in opposite phase, they cancel each other 
(case 2) 

are extremely useful in the radio region. To form 
an interferometer, one needs at least two anten¬ 
nas coupled together. The spacing between the 
antennas, D, is called the baseline. Let us first 
assume that the baseline is perpendicular to the 
line of sight (Fig. 3.32). Then the radiation ar¬ 
rives at both antennas with the same phase, and 













78 


3 Observations and Instruments 


the summed signal shows a maximum. However, 
due to the rotation of the Earth, the direction of 
the baseline changes, producing a phase differ¬ 
ence between the two signals. The result is a sinu¬ 
soidal interference pattern, in which minima oc¬ 
cur when the phase difference is 180 degrees. The 
distance between the peaks is given by 

0D = X, 

where 6 is the angle the baseline has turned and 
X is the wavelength of the received signal. The 
resolution of the interferometer is thus equal to 
that of an antenna with a linear size equal to D. 

If the source is not a point source, the radi¬ 
ation emitted from different parts of the source 
will have phase differences when it enters the an¬ 
tennas. In this case the minima of the interference 
pattern will not be zero, but will have some posi¬ 
tive value P m ; n . If we denote the maximum value 
of the interference pattern by P max , the ratio 

Pi nax ^min 

Pfnax + Pri in 

gives a measure of the source size (fringe visibil¬ 
ity). 

More detailed information about the source 
structure can be obtained by changing the base¬ 
line by changing the relative positions of the 
antennas. In principle it is possible to observe 
the same amount of details as with a single an¬ 
tenna with the same size as the area covered 
by the movable antennas in their different posi¬ 
tions. This way interferometry is transformed into 
a technique called aperture synthesis. 

The theory and techniques of aperture synthe¬ 
sis were developed particularly by the British as¬ 
tronomer Sir Martin Ryle. In Fig. 3.33 the prin¬ 
ciple of aperture synthesis is illustrated. If the 
telescopes are located on an east-west track, the 
spacing between them, projected onto the sky, 
will in 12 hours describe a circle or an ellipse, de¬ 
pending on the position of the source as the Earth 
rotates around its axis. If one varies the distance 
between the telescopes, one will get a series of 
circles or ellipses on the sky. As we can see from 
Fig. 3.29, one does not have to cover all the spac- 
ings between the telescopes, because any antenna 



Fig. 3.33 To illustrate the principle of aperture synthe¬ 
sis, let us consider an east-west oriented interferometer 
pointed towards the celestial north. Each antenna is iden¬ 
tical, has a diameter D and operates at a wavelength X. 
The minimum spacing between each antenna element is a , 
and the maximum spacing is 6a. In (a) there are only 
two antennas, A and B, displaced by the maximum spac¬ 
ing 6a. When the earth rotates, antennas A and B will, 
in the course of 12 hours, track a circle on the plane of 
the sky with a diameter X/(6a), the maximum resolution 
that can be achieved with this interferometer. In (b) the 
antenna C is added to the interferometer, thus providing 
two more baselines, which track the circles AC and BC 
with radii of X/(2a) and X/{4a), respectively. In (c) there 
is still another antenna D added to the interferometer. In 
this case two of the baselines are equal, AD and CD, and 
therefore only two new circles are covered on the plane 
of the sky. By adding more interferometer elements, one 
can fill in the missing parts within the primary beam, i.e. 
the beam of one single dish, and thus obtain a full cov¬ 
erage of the beam. The resolution of the system depends 
on the maximum separation of the antennas. It is evident 
from (c), that not all of the antenna positions are needed to 
provide all the different spacings; some antenna spacings 
will in such a case be equal and therefore provide no addi¬ 
tional information. It is essential to position the antennas 
in such a way that as many different baselines as possi¬ 
ble are formed. The observed values and the image of the 
target are related by a Fourier transform 


combination which has the same relative distance 
will describe the same path on the sky. In this way 
one can synthesise an antenna, a filled aperture, 
with a size equal to the maximum spacing be¬ 
tween the telescopes. Interferometers working ac¬ 
cording to this principle are called aperture syn- 
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Fig. 3.34 The VLA at Socorro, New Mexico, is a synthesis telescope consisting of 27 movable antennas 


thesis telescopes. If one covers all the spacings up 
to the maximum baseline, the result will be an ac¬ 
curate map of the source over the primary beam 
of an individual antenna element. Aperture syn¬ 
thesis telescopes therefore produce an image of 
the sky, i.e. a “radio photograph”. 

A typical aperture synthesis telescope consists 
of one fixed telescope and a number of mov¬ 
able telescopes, usually located on an east-west 
track, although T or Y configurations are also 
quite common. The number of telescopes used 
determines how fast one can synthesise a larger 
disk, because the number of possible antenna 
combinations increases as n(n — 1), where n is 
the number of telescopes. It is also possible to 
synthesise a large telescope with only one fixed 
and one movable telescope by changing the spac¬ 
ing between the telescopes every 12 hours, but 
then a full aperture synthesis can require several 
months of observing time. In order for this tech¬ 
nique to work, the source must be constant, i.e. 
the signal cannot be time variable during the ob¬ 
serving session. 

The most efficient aperture synthesis tele¬ 
scopes at present are the VLA (Very Large Array) 


in New Mexico, USA (Fig. 3.34) and ALMA in 
Chile. 

VLA consists of 27 paraboloid antennas, each 
with a diameter of 25 m, which are located on 
a Y-shaped track. The Y-formation was chosen 
because it provides a full aperture synthesis in 
8 hours. Each antenna can be moved by a spe¬ 
cially built carrier, and the locations of the tele¬ 
scopes are chosen to give optimal spacings for 
each configuration. In the largest configuration 
each arm is about 21 km long, thereby resulting 
in an antenna with an effective diameter of 35 km. 
If the VLA is used in its largest configuration 
and at its highest frequency, 23 GHz (1.3 cm), 
the resolution achieved is 0.1 arc second, clearly 
superior to any optical telescope. Similar resolu¬ 
tion can also be obtained with the British MER¬ 
LIN telescope, where already existing telescopes 
have been coupled together by radio links. Other 
well-known synthesis telescopes are the Cam¬ 
bridge 5 km array in Britain and the Westerbork 
array in the Netherlands, both located on east- 
west tracks. 

Even higher resolution can be obtained with 
an extension of the aperture synthesis technique, 
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called VLBI (Very Long Baseline Interferome¬ 
try). With the VLBI technique the spacing be¬ 
tween the antennas is restricted only by the size 
of the Earth. Some of the antennas can even be 
on satellites, in which case the baseline can be 
arbitrarily long. The antennas are all pointed to¬ 
wards the same source. The signal is recorded to¬ 
gether with accurate timing signals from atomic 
clocks. The data files are correlated against each 
other, resulting in maps similar to those obtained 
with a normal aperture synthesis telescope. With 
VLBI techniques it is possible to achieve resolu¬ 
tions of 0.00001" if one of the telescopes is or¬ 
biting the Earth. Because interferometry is very 
sensitive to the distance between the telescopes, 
the VLBI technique also provides one of the most 
accurate methods to measure distances. Currently 
one can measure distances with an accuracy of 
a few millimetres on intercontinental baselines. 
This is utilised in geodetic VLBI experiments, 
which study continental drift and polar motion as 
a function of time. 

In radio astronomy the maximum size of sin¬ 
gle antennas has also been reached. The trend 
is to build synthesis antennas. In the 1990’s The 
United States built a chain of antennas extending 
across the whole continent, and the Australians 
have constructed a similar, but north-south an¬ 
tenna chain across their country. 

More and more observations are being made 
in the submillimetre region. The disturbing effect 
of atmospheric water vapour becomes more se¬ 
rious at shorter wavelengths; thus, submillimetre 
telescopes must be located on mountain tops, like 
optical telescopes. All parts of the mirror are ac¬ 
tively controlled in order to accurately maintain 
the proper form like in the new optical telescopes. 
Several new submillimetre telescopes are under 
construction. 


3.5 Other Wavelength Regions 

The Earth receives electromagnetic radiation 
from the sky at all wavelengths However, as 
mentioned in Sect. 3.1, not all radiation reaches 
the ground. The wavelength regions absorbed by 
the atmosphere have only been studied more ex¬ 
tensively since the 1970’s, using Earth-orbiting 


satellites (Fig. 3.37). Besides the optical and ra¬ 
dio regions, there are only some narrow wave¬ 
length ranges in the infrared that can be observed 
from high mountain tops. 

The first observations in each new wavelength 
region were usually carried out from balloons, but 
not until rockets came into use could observa¬ 
tions be made from outside the atmosphere. The 
first actual observations of an X-ray source, for 
instance, were made on a rocket flight in June 
1962, when the detector rose above the atmo¬ 
sphere for about 6 minutes. Finally, Earth orbiting 
satellites have made it possible to map the whole 
sky in the wavelength regions invisible from the 
ground. 

Gamma Radiation Gamma ray astronomy 
studies radiation quanta with energies of 
10 5 -10 14 eV. The boundary between gamma and 
X-ray astronomy, 10 5 eV, corresponds to a wave¬ 
length of 10“ 11 m. The boundary is not fixed; the 
regions of hard (= high-energy) X-rays and soft 
gamma rays partly overlap. 

While ultraviolet, visible and infrared radia¬ 
tion are all produced by changes in the energy 
states of the electron envelopes of atoms, gamma 
and hard X-rays are produced by transitions in 
atomic nuclei or in mutual interactions of ele¬ 
mentary particles. Thus observations of the short¬ 
est wavelengths give information on processes 
different from those giving rise to longer wave¬ 
lengths. 

The first observations of gamma sources were 
obtained at the end of the 1960’s, when a device 
in the OSO 3 satellite (Orbiting Solar Observa¬ 
tory) detected gamma rays from the Milky Way. 
Later on, many satellites have been especially 
designed for gamma astronomy. The most re¬ 
cent satellites include the Compton Gamma Ray 
Observatory, operating in 1991-2000, the Euro¬ 
pean Integral, launched in 2002, and the Ameri¬ 
can Fermi, originally called GLAST. 

The quanta of gamma radiation have energies 
a million times greater than those of visible light, 
and therefore they cannot be observed with sim¬ 
ilar detectors. Observations are made with vari¬ 
ous scintillation detectors , usually composed of 
several layers of detector plates, where gamma 
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Fig. 3.35 The MAGIC telescope on La Palma in the Ca¬ 
nary islands observes the Cherenkov radiation produced 
by gamma rays. (Photo Robert Wagner, Max Planck Insti- 
tut fur Physik) 

radiation is transformed by the photoelectric ef¬ 
fect into visible light, detectable by photomulti¬ 
pliers. 

The energy of a gamma quantum can be deter¬ 
mined from the depth to which it penetrates the 
detector. Analysing the trails left by the quanta 
gives information on their approximate direction. 
The field of view is limited by the grating. The 
directional accuracy is low, and in gamma astron¬ 
omy the resolution is still far below that of other 
wavelength regions. 

Gamma radiation cannot pass through the at¬ 
mosphere, yet it has effects that can be observed 
with ground based instruments. When a high- 
energy photon enters the atmosphere it can cre¬ 
ate particle-antiparticle pairs. These particles can 
get such a high energy that they move faster than 
the local speed of light in the atmosphere. Such 
particles radiate Cherenkov radiation that can be 
detected as visible light (Fig. 3.35). 

X-rays The observational domain of X-ray as¬ 
tronomy includes the energies between 10 2 and 
10 5 eV, or the wavelengths 10-0.01 nm. The re¬ 
gions 10-0.1 nm and 0.1-0.01 nm are called soft 


and hard X-rays, respectively. X-rays were dis¬ 
covered in the late 19th century. Systematic stud¬ 
ies of the sky at X-ray wavelengths only became 
possible in the 1970’s with the advent of satellite 
technology. 

The first all-sky mapping was made in the 
early 1970’s by SAS 1 (Small Astronomical 
Satellite), also called Uhuru. At the end of the 
1970’s, two High-Energy Astronomy Observa¬ 
tories, HEAO 1 and 2 (the latter called Einstein), 
mapped the sky with much higher sensitivity than 
Uhuru. 

The Einstein Observatory was able to detect 
sources about a thousand times fainter than ear¬ 
lier X-ray telescopes. In optical astronomy, this 
would correspond to a jump from a 15 cm reflec¬ 
tor to a 5 m telescope. Thus X-ray astronomy has 
developed in 20 years as much as optical astron¬ 
omy in 300 years. 

The latest X-ray satellites have been the Amer¬ 
ican Chandra and the European XMM-Newton, 
both launched in 1999. 

Besides satellites mapping the whole sky, 
there have been several satellites observing the X- 
ray radiation of the Sun. The most recent ones are 
the Japanese Yohkoh and Hinode and the Ameri¬ 
can RHESSI. 

The first X-ray telescopes used detectors sim¬ 
ilar to those in gamma astronomy. Their di¬ 
rectional accuracy was never better than a few 
arc minutes. The more precise X-ray telescopes 
utilise the principle of grazing reflection 
(Fig. 3.36). An X-ray hitting a surface perpendic¬ 
ularly is not reflected, but absorbed. If, however, 
X-rays meet the mirror nearly parallel to its sur¬ 
face, just grazing it, a high quality surface can 
reflect the ray. 

The mirror of an X-ray reflector is on the in¬ 
ner surface of a slowly narrowing cone. The outer 
part of the surface is a paraboloid and the inner 
part a hyperboloid. The rays are reflected by both 
surfaces and meet at a focal plane. In practice, 
several tubes are installed one within another. For 
instance, the four cones of the Einstein Obser¬ 
vatory had as much polished optical surface as 
a normal telescope with a diameter of 2.5 m. The 
resolution in X-ray telescopes is of the order of 
a few arc seconds and the field of view about 
1 deg. 
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Fig. 3.36 X-rays are not reflected by an ordinary mirror, 
and the principle of grazing reflection must be used for 
collecting them. Radiation meets the paraboloid mirror at 
a very small angle, is reflected onto a hyperboloid mirror 

The detectors in X-ray astronomy are usually 
Geiger-Miiller counters, proportional counters 
or scintillation detectors. Geiger-Miiller and pro¬ 
portional counters are boxes filled with gas. The 
walls form a cathode, and an anode wire runs 
through the middle of the box; in more accu¬ 
rate counters, there are several anode wires. An 
X-ray quantum entering the box ionises the gas, 
and the potential difference between the anode 
and cathode gives rise to a current of electrons 
and positive ions. In recent imaging instruments 
(like the ACIS in Chandra) also CCD cameras are 
used. 

Ultraviolet Radiation Between X-rays and the 
optical region lies the domain of ultraviolet radi¬ 
ation, with wavelengths between 10 and 400 nm. 
Most ultraviolet observations have been carried 
out in the soft UV region, at wavelengths near 
those of optical light, since most of the UV radi¬ 
ation is absorbed by the atmosphere. The wave¬ 
lengths below 300 nm are completely blocked 
out. The short wavelength region from 10 to 
91.2 nm is called the extreme ultraviolet (EUV, 
XUV). 

Extreme ultraviolet was one of the last regions 
of the electromagnetic radiation to be observed 
systematically. The reason for this is that the ab¬ 
sorption of interstellar hydrogen makes the sky 
practically opaque at these wavelengths. The vis¬ 
ibility in most directions is limited to some hun¬ 
dred light years in the vicinity of the Sun. In some 
directions, however, the density of the interstellar 


and further to a focal point. In practice, several mirrors are 
placed one inside another, collecting radiation in a com¬ 
mon focus 


gas is so low that even extragalactic objects can 
be seen. 

The first dedicated EUV satellite was the Ex¬ 
treme Ultraviolet Explorer (EUVE), operating in 
1992-2000. It observed about a thousand EUV 
sources. In EUV grazing reflection telescopes 
similar to those used in X-ray astronomy are em¬ 
ployed. 

In nearly all branches of astronomy important 
information is obtained by observations of ultra¬ 
violet radiation. Many emission lines from stel¬ 
lar chromospheres or coronas, the Lyman lines of 
atomic hydrogen, and most of the radiation from 
hot stars are found in the UV domain. In the near- 
ultraviolet, telescopes can be made similar to op¬ 
tical telescopes and, equipped with a photome¬ 
ter or spectrometer, installed in a satellite orbiting 
the Earth. 

The most effective satellites in the UV have 
been the European TD-1, the American Orbit¬ 
ing Astronomical Observatories OAO 2 and 3 
(Copernicus), and the International Ultraviolet 
Explorer IUE. The instruments of the TD-1 satel¬ 
lite included both a photometer and a spectrome¬ 
ter. The satellite measured the magnitudes of over 
30,000 stars in four different spectral regions be¬ 
tween 135 and 274 nm, and registered UV spec¬ 
tra from over 1000 stars. The OAO satellites were 
also used to measure magnitudes and spectra, and 
OAO 3 worked for over eight years. 

The IUE satellite, launched in 1978, was one 
of the most successful astronomical satellites. 
IUE had a 45 cm Ritchey-Chretien telescope with 
an aperture ratio of //15 and a field of view of 
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Fig. 3.37 (a) The European X-ray satellite XMM- far ultraviolet objects from Earth orbit since 1999. (Graph- 
Newton was launched in 1999. (Drawing D. Ducros, ics NASA/JHU Applied Physics Laboratory) 

XMM Team, ESA.) (b) FUSE satellite has photographed 


16 arc minutes. The satellite had two spectro¬ 
graphs to measure spectra of higher or lower res¬ 
olution in wavelength intervals of 115-200 nm 


or 190-320 nm. For registration of the spec¬ 
tra, a Vidicon camera was used. 1UE was differ¬ 
ent from earlier satellites, since it could be used 
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almost like ground based telescopes. The ob¬ 
server could follow the observations and change 
them in real time. IUE worked on the orbit for 
20 years. 

The Hubble space telescope can also make 
observations in the ultraviolet. Some ultraviolet 
telescopes have also been on shuttle flight in the 
1990’s. 

Infrared Radiation Radiation with longer 
wavelengths than visible light is called infrared 
radiation. This region extends from about 1 mi¬ 
crometre to 1 millimetre, where the radio region 
begins. Sometimes the near-infrared, at wave¬ 
lengths below 5 pm, and the submillimetre do¬ 
main, at wavelengths between 0.1 and 1 mm, are 
considered separate wavelength regions. 

In infrared observations radiation is collected 
by a telescope, as in the optical region. The in¬ 
coming radiation consists of radiation from the 
object, from the background and from the tele¬ 
scope itself. Both the source and the background 
must be continually measured, the difference giv- 



Fig. 3.38 Refractors are not suitable for infrared tele¬ 
scopes, because infrared radiation cannot penetrate glass. 
The Cassegrain reflectors intended especially for infrared 
observations have secondary mirrors nodding rapidly back 
and forth between the object and the background near the 
object. By subtracting the brightness of the background 
from the brightness of the object, the background can be 
eliminated 


ing the radiation from the object. The background 
measurements can be made with a Cassegrain 
secondary mirror oscillating between the source 
and the background at a rate of, say, 100 oscil¬ 
lations per second, and thus the changing back¬ 
ground can be eliminated (Fig. 3.38). To register 
the measurements, semiconductor detectors are 
used. The detector must always be cooled to min¬ 
imise its own thermal radiation. Sometimes the 
whole telescope is cooled. 

Infrared observatories have been built on high 
mountain tops, where most of the atmospheric 
water vapour remains below. Some favourable 
sites are, e.g. Mauna Kea on Hawaii, Mount 
Lemon in Arizona and Pico del Teide on Tenerife. 
For observations in the far-infrared these moun¬ 
tains are not high enough; these observations are 
carried out, e.g. on aeroplanes. One of the best- 
equipped planes is the Kuiper Airborne Observa¬ 
tory KAO, named after the well-known planetary 
scientist Gerard Kuiper. 

Balloons and satellites are also used for in¬ 
frared observations. The first powerful infrared 
observatory was the InfraRed Astronomy Satel¬ 
lite IRAS, built by the United States and Holland 
and launched in 1983. It made observations for 
eight months and mapped the sky at four wave¬ 
lengths (12, 25, 60 and 100 pm), and found over 
200,000 new infrared sources. 

The European Infrared Space Observatory 
ISO, made more detailed observations of thou¬ 
sands of infrared objects in 1996-1998. Since 
2003 Spitzer (originally SIRTF, Space InfraRed 
Telescope Facility), has been mapping the in¬ 
frared sky. In 2009 ESA launched its own Her- 
schel satellite (Fig. 3.39), which operated till 
2013, when the liquid helium used for cooling 
was exhausted. 

Infrared satellites have also been used to map 
the afterglow of the big bang of the universe. 
A very successful satellite was the 1989 launched 
COBE (Cosmic Background Explorer), which 
mapped the background radiation in submillime¬ 
tre and infrared wavelengths. The Microwave 
Anisotropy Probe (MAP) has continued the work 
of COBE, starting in 2001. In 2009 the even 
more powerful European Planck satellite was 
launched. 
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Fig. 3.39 The European 
Herschel launched in 2009 
was the most efficient 
recent infrared satellite. 

The diameter of its main 
mirror, 3.5 metres, was 
bigger than in any earlier 
astronomical satellite. The 
final shape to the highly 
curved mirror was given by 
the Opteon company at the 
Tuorla observatory in 
Finland. The observations 
ended in 2103 when the 
liquid helium used to cool 
down the camera ran out. 
(Photos ESA, Rami 
Rekola) 



3.6 Other Forms of Energy 

Besides electromagnetic radiation, energy arrives 
from space in other forms: particles (cosmic rays, 
neutrinos) and gravitational radiation. 

Cosmic Rays Cosmic rays, consisting of elec¬ 
trons and totally ionised nuclei of atoms, are re¬ 
ceived in equal amounts from all directions. Their 
incoming directions do not reveal their origin, 


since cosmic rays are electrically charged; thus 
their paths are continually changed when they 
move through the magnetic fields of the Milky 
Way. The high energies of cosmic rays mean that 
they have to be produced by high-energy phe¬ 
nomena like supernova explosions. The major¬ 
ity of cosmic rays are protons (nearly 90 %) 
and helium nuclei (10 %), but some are heav¬ 
ier nuclei; their energies lie between 10 s and 
10 20 eV. 
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Fig. 3.39 (Continued) 



The most energetic cosmic rays give rise to 
secondary radiation , mainly muons, when they 
hit molecules of the atmosphere. This secondary 
radiation can be observed from the ground. Just 
like particles produces by gamma rays particles 
of the secondary radiation can move faster than 
light in the atmosphere emitting Cerenkov radi¬ 
ation. Thus the cosmic rays can also be detected 
by their Cerenkov radiation. 

The Pierre Auger telescope in Argentina ob¬ 
serves both secondary particles and Cerenkov ra¬ 
diation to detect cosmic rays with energies at least 
of the order of 10 18 eV. Since only about one 
such particle hits an area of one square kilometre 
in a century, the detectors have been positioned in 
an area of 3000 square kilometres. 

Primary cosmic rays can only be directly ob¬ 
served outside the atmosphere. The detectors 
used to observe cosmic rays are similar to those 
used in particle physics. Since Earth-based accel¬ 
erators reach energies of only about 10 12 eV, cos¬ 
mic rays offer an excellent “natural” laboratory 
for particle physics. Many satellites and space¬ 
craft have detectors for cosmic rays. 

Neutrinos Neutrinos are elementary particles 
with no electric charge. Earlier it was thought that 


they are massless, but currently they seem to have 
a tiny rest mass. They interact with other mat¬ 
ter only through the weak nuclear force, which 
is very much weaker than the electromagnetic or 
strong nuclear force. 

Most neutrinos are produced in nuclear reac¬ 
tions within stars and in supernova explosions. 
Since they react very weakly with other matter, 
they escape directly from the stellar interior. 

Due to their weak interaction neutrinos are 
very difficult to observe; trillions of them pass 
through this page every second without any no¬ 
ticeable consequences. 

The first method of detection was the radio¬ 
chemical method. As a reactive agent, e.g. tetra- 
chloroethene (C 2 CI 4 ) can be used. When a neu¬ 
trino hits a chlorine atom, the chlorine is trans¬ 
formed into argon, and an electron is freed: 

37 Cl+v-* 37 Ar+e“. 

The argon atom is radioactive and can be ob¬ 
served. Instead of chlorine, lithium and gallium 
might be used to detect neutrinos. The first gal¬ 
lium detectors have been running in Italy and 
Russia from the end of the 1980’s. 

Another observation method is based on the 
Cerenkov radiation produced by neutrinos in ex- 
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Fig. 3.40 The Sudbury neutrino observatory (SNO) is at 
the depth of two kilometres in a still operational mine. The 
spherical vessel contains liquid where neutrinos produce 
Cherenkov radiation. 9600 photomultipliers are used to 
register the flashes of the radiation. Earlier the liquid was 
heavy water but that was later replaced with alkyl benzene. 
The observations have already solved the solar neutrino 
problem (Sect. 13.1). The new SNO+ will be able to ob¬ 
serve neutrinos of even lower energies. (Photos SNOlab, 
H. Karttunen) 


tremely pure water. The flashes of light are regis¬ 
tered with photomultipliers, and thus it is possi¬ 
ble to find out the direction of the radiation. This 
method is used e.g. in the Japanese Kamiokande 
detector and in the Sudbury neutrino observatory 
in Canada (Fig. 3.40). 

The largest neutrino detector is the IceCube in 
the Antarctic, finished in 2010. It consists of one 
cubic kilometre of ice. 

Neutrino detectors must be located deep under 
the ground to protect them from the secondary 
radiation caused by cosmic rays. Also radioactive 
decay inside the Earth cases background noise. 

The detectors have observed neutrinos from 
the Sun, and the Supernova 1987A in the Large 
Magellanic Cloud was also observed in 1987. In 
the future, also the dark matter and other neutrino 
producing phenomena will be studied. 

Gravitational Radiation Gravitational astron¬ 
omy is as young as neutrino astronomy. The 
first attempts to measure gravitational waves were 
made in the 1960’s. Gravitational radiation is 
emitted by accelerating masses, just as electro¬ 
magnetic radiation is emitted by electric charges 
in accelerated motion. Detection of gravitational 
waves is very difficult, and they have been ob¬ 
served only very recently. 

The first type of gravitational wave antenna 
was the Weber cylinder. It is an aluminium cylin¬ 
der which starts vibrating at its proper frequency 
when hit by a gravitational pulse. The distance 
between the ends of the cylinder changes by 
about 10“ 17 m, and the changes in the length are 
studied by strain sensors welded to the side of the 
cylinder. 

Another type of modern gravity radiation de¬ 
tectors measures “spatial strain” induced by grav¬ 
ity waves and consists of two sets of mirrors in 
directions perpendicular to each other (Michel- 
son interferometer), or one set of parallel mir¬ 
rors (Fabry-Perot interferometer). The relative 
distances between the mirrors are monitored by 
laser interferometers. If a gravity pulse passes the 
detector, the distances change and the changes 
can be measured. The longest baseline between 
the mirrors is in the American LIGO (Laser Inter- 
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Fig. 3.41 The LIGO Livingston Observatory seen from 
the air. (Photo LIGO/Caltech) 


ferometer Gravitational-wave Observatory) sys¬ 
tem (Fig. 3.41). LIGO made the first scientific 
observations in 2002. In early 2016 the first pos¬ 
itive detection of gravitational waves was an¬ 
nounced. The observation was made in Septem¬ 
ber 14, 2015. 

Although the gravitational detectors are iso¬ 
lated from their surroundings as carefully as pos¬ 
sible, many perturbations can make them oscil¬ 
late. A positive detection requires that the same 
phenomenon is observed by different instruments 
located far away from each others. 


Box 3.1 (Diffraction by a Circular Aperture) 
Consider a circular hole of radius R in the 
xy plane. Coherent light enters the hole from 
the direction of the negative z axis (see fig¬ 
ure). We consider light rays leaving the hole 
parallel to the xz plane forming an angle 6 
with the z axis. The light waves interfere on 
a screen far away. The phase difference be¬ 
tween a wave through a point (x, y) and a wave 
going through the centre of the hole can be 
calculated from the different path lengths s = 
x sin 0 : 


8 = 


s 2 tc sin 0 

-2jt = - x = kx. 

X X 



Thus, the phase difference 8 depends on the 
x coordinate only. The sum of the amplitudes 
of the waves from a small surface element is 
proportional to the area of the element dxdy. 
Let the amplitude coming through the centre of 
the hole be d«o = dx d yi. The amplitude com¬ 
ing from the point (x, y) is then 

da = dx dy (cos Si + sin 8j ). 



da 0 


We sum up the amplitudes coming from dif¬ 
ferent points of the hole: 


« = / 

JA 


da 


Aperture 


r f 

Jx=—R Jy 


x/r 2 -x 2 

=-s/R 2 -x 


. (cos kx i + sin kx j ) dy dr 


= 2 / x/R- — x 2 (cos kx i + sin kx j ) dx. 
J-R 


Since sine is an odd function (sin(— kx) — 
— sin(£x)), we get zero when we integrate the 
second term. Cosine is an even function, and so 


a oc 



x 2 cos kx dx. 
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We substitute x = Rt and define p = kR — 
(2 7tr sind)/X, thus getting 


a oc 


/Vi ^ 2 

Jo 


t 2 cos pt dr. 


The zero points of the intensity observed on 
the screen are obtained from the zero points of 
the amplitude. 


J(p) 


-/'VT 


t 2 cos pt dr = 0. 


Inspecting the function J (p), we see that the 
first zero is at p — 3.8317, or 

2rr R sin 6 

-= 3.8317. 

X 

The radius of the diffraction disk in angular 
units can be estimated from the condition 


sin0 = 


3.83m 

2nR 


X 

1 . 22 —, 

D 


where D = 2R is the diameter of the hole. 

In mirror telescopes diffraction is caused 
also by the support structure of the secondary 
mirror. If the aperture is more complex and 
only elementary mathematics is used calcula¬ 
tions may become rather cumbersome. How¬ 
ever, it can be shown that the diffraction pattern 
can be obtained as the Fourier transform of the 
aperture. 


3.7 Examples 

Example 3.1 The distance between the compo¬ 
nents of the binary star £ Herculis is 1.38". What 
should the diameter of a telescope be to resolve 
the binary? If the focal length of the objective is 
80 cm, what should the focal length of the eye¬ 
piece be to resolve the components, when the res¬ 
olution of the eye is 2! ? 

In the optical region, we can use the wave¬ 
length value of X rs 550 nm. The diameter of the 
objective is obtained from the equation for the 


resolution (3.4), 

„^X_ 550 x 10 -9 

~ 6 ~ (1.38/3600) x (tt/180) 

= 0.08 m = 8 cm. 


The required magnification is 
2 ' 

co= -= 87. 

1.38" 

The magnification is given by 

/ 

CO = —, 

r 


and, thus, the focal length of the eyepiece should 
be 

, f 80 cm 

f =-= -= 0.9 cm. 

J w 87 


Example 3.2 A telescope has an objective with 
a diameter of 90 mm and focal length of 1200 mm. 


(a) What is the focal length of an eyepiece, the 
exit pupil of which is 6 mm (about the size of 
the pupil of the eye)? 

(b) What is the magnification of such an eye¬ 
piece? 

(c) What is the angular diameter of the Moon 
seen through this telescope and eyepiece? 

(a) From Fig. 3.7 we get 



whence 


/' = /— = 1200 mm 

j j D 


6 mm 
90 mm 


= 80 mm. 


(b) The magnification is co — f/f' = 1200 mm/ 
80 mm = 15. 

(c) Assuming the angular diameter of the Moon 
is a — 3T = 0.52°, its diameter through the 
telescope is coa — 7.8°. 
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3.8 Exercises 

Exercise 3.1 The Moon was photographed with 
a telescope, the objective of which had a diameter 
of 20 cm and focal length of 150 cm. The expo¬ 
sure time was 0.1s. 

(a) What should the exposure time be, if the di¬ 
ameter of the objective were 15 cm and focal 
length 200 cm? 

(b) What is the size of the image of the Moon in 
both cases? 


(c) Both telescopes are used to look at the Moon 
with an eyepiece the focal length of which is 
25 mm. What are the magnifications? 

Exercise 3.2 The radio telescopes at Amherst, 
Massachusetts, and Onsala, Sweden, are used as 
an interferometer, the baseline being 2900 km. 

(a) What is the resolution at 22 GHz in the direc¬ 
tion of the baseline? 

(b) What should be the size of an optical tele¬ 
scope with the same resolution? 
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Most astronomical observations utilise electro¬ 
magnetic radiation in one way or another. We can 
obtain information on the physical nature of a ra¬ 
diation source by studying the energy distribution 
of its radiation. We shall now introduce some ba¬ 
sic concepts that characterise electromagnetic ra¬ 
diation. 


4.1 Intensity, Flux Density and 
Luminosity 

Let us assume we have some radiation passing 
through a surface element d A (Fig. 4.1). Some of 
the radiation will leave dA within a solid angle 
d a>; the angle between dco and the normal to the 
surface is denoted by 0. The amount of energy 
with frequency in the range [v, v + dv] entering 
this solid angle in time dr is 

dE v — I v cos# dA dv dcodt. (4.1) 

Here, the coefficient I v is the specific intensity 
of the radiation at the frequency v in the di¬ 
rection of the solid angle da>. Its dimension is 
W m~ 2 Hz -1 sterad -1 . 

The projection of the surface element dA as 
seen from the direction 0 is d,4„ = dA cos/9, 
which explains the factor cos 0. If the intensity 
does not depend on direction, the energy d E v is 
directly proportional to the surface element per¬ 
pendicular to the direction of the radiation. 

The intensity including all possible frequen¬ 
cies is called the total intensity I , and is obtained 



Fig. 4.1 The intensity /„ of radiation is related to the en¬ 
ergy passing through a surface element dA into a solid 
angle dtu, in a direction 0 


by integrating I v over all frequencies: 



More important quantities from the observa¬ 
tional point of view are the energy flux ( L v , L) 
or, briefly, the flux and the flux density ( F v , F). 
The flux density gives the power of radiation per 
unit area; hence its dimension is Wm _2 Hz _1 
or W m -2 , depending on whether we are talking 
about the flux density at a certain frequency or 
about the total flux density. 

Observed flux densities are usually rather 
small, and W nr 2 would be an inconveniently 
large unit. Therefore, especially in radio astron- 
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omy, flux densities are often expressed in Jan- 
skys\ one Jansky (Jy) equals 10 -26 W m 2 Hz -1 . 

When we are observing a radiation source, we 
in fact measure the energy collected by the de¬ 
tector during some period of time, which equals 
the flux density integrated over the radiation¬ 
collecting area of the instrument and the time in¬ 
terval. 

The flux density F v at a frequency v can be 
expressed in terms of the intensity as 

F v = --- [ d E v 

dA dv dr J s 

= J 7yCos(9d&), (4.2) 

where the integration is extended over all possible 
directions. Analogously, the total flux density is 

F — J 7cos0d&>. 

For example, if the radiation is isotropic, i.e. if 
7 is independent of the direction, we get 

F= 7cos(?d&> = 7 / cos(?d&>. (4.3) 

Js Js 

The solid angle element dco is equal to a surface 
element on a unit sphere. In spherical coordinates 
it is (Fig. 4.2; also cf. Appendix A. 5): 

da> = sinf? dd dtp. 

Substitution into (4.3) gives 

/»7r /»2 7T 

F = I I cos 6 sin 6 dO d(f> = 0, 

Je=oJtp=o 

so there is no net flux of radiation. This means 
that there are equal amounts of radiation entering 
and leaving the surface. If we want to know the 
amount of radiation passing through the surface, 
we can find, for example, the radiation leaving the 
surface. For isotropic radiation this is 

/*7t/2 p 2 tt 

F\ = I / cost? sinf? d9 dtp = jtF (4.4) 
Jg = o J</> = o 

In the astronomical literature, terms such as 
intensity and brightness are used rather vaguely. 



Fig. 4.2 An infinitesimal solid angle da; is equal to 
the corresponding surface element on a unit sphere: 
da> = sin 8 d 8 dtp 

Flux density is hardly ever called flux density 
but intensity or (with luck) flux. Therefore the 
reader should always carefully check the mean¬ 
ing of these terms. 

Flux means the power going through some 
surface, expressed in watts. The flux emitted by 
a star into a solid angle &> is L = cor 2 F, where 
F is the flux density observed at a distance r. To¬ 
tal flux is the flux passing through a closed sur¬ 
face encompassing the source. Astronomers usu¬ 
ally call the total flux of a star the luminosity L. 
We can also talk about the luminosity L v at a fre¬ 
quency v ([7.y] = WHz -1 ). (This must not be 
confused with the luminous flux used in physics; 
the latter takes into account the sensitivity of the 
eye.) 

If the source (like a typical star) radiates 
isotropically, its radiation at a distance r is dis¬ 
tributed evenly on a spherical surface whose area 
is 4rtr 2 (Fig. 4.3). If the flux density of the radi¬ 
ation passing through this surface is F, the total 
flux is 

L = 4-7tr 2 F. (4.5) 

If we are outside the source, where radiation is 
not created or destroyed, the luminosity does not 
depend on distance. The flux density, on the other 
hand, falls off proportional to 1 /r 2 . 
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For extended objects (as opposed to objects 
such as stars visible only as points) we can de¬ 
fine the surface brightness as the flux density per 
unit solid angle (Fig. 4.4). Now the observer is at 
the apex of the solid angle. The surface bright¬ 
ness is independent of distance, which can be un¬ 
derstood in the following way. The flux density 
arriving from an area A is inversely proportional 
to the distance squared. But also the solid an¬ 
gle subtended by the area A is proportional to 
1 /r 2 ( a) — A/r 2 ). Thus the surface brightness 
B — FIco remains constant. 



Fig. 4.3 An energy flux which at a distance r from a point 
source is distributed over an area A is spread over an 
area 4A at a distance 2 r. Thus the flux density decreases 
inversely proportional to the distance squared 


The energy density u of radiation is the amount 
of energy per unit volume (J m -3 ): 

u = - [ I dcu. (4.6) 

c Js 

This can be seen as follows. Suppose we have ra¬ 
diation with intensity I arriving from a solid an¬ 
gle d&> perpendicular to the surface dA (Fig. 4.5). 
In the time dr, the radiation travels a distance c dr 
and fills a volume dV = cdr dA. Thus the energy 
in the volume dV is (now cos# =1) 

1 

d E = /dAdtudr = -I dtwdV. 

c 

Hence the energy density d u of the radiation ar¬ 
riving from the solid angle d&> is 

d E 1 

d u = — = — Ida), 
dV c 

and the total energy density is obtained by inte¬ 
grating this over all directions. For isotropic radi¬ 
ation we get 



c 


4.2 Apparent Magnitudes 

As early as the second century B.C., Hipparchos 
divided the visible stars into six classes according 
to their apparent brightness. The first class con¬ 
tained the brightest stars and the sixth the faintest 
ones still visible to the naked eye. 


Fig. 4.4 An observer sees 
radiation coming from 
a constant solid angle w. 
The area giving off 
radiation into this solid 
angle increases when the 
source moves further away 
(A cx. r 2 ). Therefore the 
surface brightness or the 
observed flux density per 
unit solid angle remains 
constant 
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Fig. 4.5 In time df, the radiation fills a volume 
dV = cdt dA, where d A is the surface element perpen¬ 
dicular to the propagation direction of the radiation 

The response of the human eye to the bright¬ 
ness of light is not linear. If the flux densities 
of three stars are in the proportion 1:10:100, the 
brightness difference of the first and second star 
seems to be equal to the difference of the second 
and third star. Equal brightness ratios correspond 
to equal apparent brightness differences: the hu¬ 
man perception of brightness is logarithmic. 

The rather vague classification of Hipparchos 
was replaced in 1856 by Norman R. Pogson. The 
new, more accurate classification followed the old 
one as closely as possible, resulting in another of 
those illogical definitions typical of astronomy. 
Since a star of the first class is about one hun¬ 
dred times brighter than a star of the sixth class, 
Pogson defined the ratio of the brightnesses of 
classes n and n + 1 as <J 100 = 2.512. 

The brightness class or magnitude can be de¬ 
fined accurately in terms of the observed flux den¬ 
sity F ([/■’] = W m 2 ). We decide that the magni¬ 
tude 0 corresponds to some preselected flux den¬ 
sity Fq. All other magnitudes are then defined by 
the equation 


m = - 2.5 lg£. (4.8) 

Fo 

Note that the coefficient is exactly 2.5, not 2.512! 
Magnitudes are dimensionless quantities, but to 
remind us that a certain value is a magnitude, we 
can write it, for example, as 5 mag or 5 m . 

It is easy to see that (4.8) is equivalent to Pog- 
son’s definition. If the magnitudes of two stars 
are m and m + 1 and their flux densities F m 
and F m+ \ , respectively, we have 


m - (m + 1) = -2.5 lg ^ + 2.5 lg F ^±l L 
Fq F 0 


-2.5 lg 


F m 
Fm +1 


whence 

-^- = 4 / Too. 

Fm +1 

In the same way we can show that the magnitudes 
mi and m 2 of two stars and the corresponding 
flux densities F\ and Fi are related by 

F\ 

m\ — m2 — — 2.5 lg—. (4.9) 

Fi 

Magnitudes extend both ways from the orig¬ 
inal six values. The magnitude of the brightest 
star, Sirius, is in fact negative —1.5. The magni¬ 
tude of the Sun is —26.8 and that of a full moon 
—12.5. The magnitude of the faintest objects ob¬ 
served depends on the size of the telescope, the 
sensitivity of the detector and the exposure time. 
The limit keeps being pushed towards fainter ob¬ 
jects; currently the magnitudes of the faintest ob¬ 
served objects are over 30. 


4.3 Magnitude Systems 

The apparent magnitude m , which we have just 
defined, depends on the instrument we use to 
measure it. The sensitivity of the detector is dif¬ 
ferent at different wavelengths. Also, different 
instruments detect different wavelength ranges. 
Thus the flux measured by the instrument equals 
not the total flux, but only a fraction of it. De¬ 
pending on the method of observation, we can de¬ 
fine various magnitude systems. Different mag¬ 
nitudes have different zero points, i.e. they have 
different flux densities Fo corresponding to the 
magnitude 0. The zero points are usually defined 
by a few selected standard stars. 

In daylight the human eye is most sensitive to 
radiation with a wavelength of about 550 nm, the 
sensitivity decreasing towards red (longer wave¬ 
lengths) and violet (shorter wavelengths). The 
magnitude corresponding to the sensitivity of the 
eye is called the visual magnitude m v . 

Photographic plates are usually most sensitive 
at blue and violet wavelengths, but they are also 
able to register radiation not visible to the hu¬ 
man eye. Thus the photographic magnitude m pg 
usually differs from the visual magnitude. The 
sensitivity of the eye can be simulated by using 
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a yellow filter and plates sensitised to yellow and 
green light. Magnitudes thus observed are called 
photovisual magnitudes w pv . 

If, in ideal case, we were able to measure the 
radiation at all wavelengths, we would get the 
bolometric magnitude fflboi- In practice this is 
very difficult, since part of the radiation is ab¬ 
sorbed by the atmosphere; also, different wave¬ 
lengths require different detectors. (In fact there 
is a gadget called the bolometer, which, however, 
is not a real bolometer but an infrared detector.) 
The bolometric magnitude can be derived from 
the visual magnitude if we know the bolometric 
correction BC: 

mboi = m v — BC. (4.10) 

By definition, the bolometric correction is zero 
for radiation of solar type stars (or, more pre¬ 
cisely, stars of the spectral class F5). Although the 
visual and bolometric magnitudes can be equal, 
the flux density corresponding to the bolometric 
magnitude must always be higher. The reason of 
this apparent contradiction is in the different val¬ 
ues of Fq. 

The more the radiation distribution differs 
from that of the Sun, the higher the bolometric 
correction is. The correction is positive for stars 
both cooler or hotter than the Sun. Sometimes 
the correction is defined as Wbol = m v + BC in 
which case BC < 0 always. The chance for er¬ 
rors is, however, very small, since we must have 
w bo i <m v . 

The most accurate magnitude measurements 
are made using photoelectric photometers or 
CCD cameras. Usually filters are used to allow 
only a certain wavelength band to enter the de¬ 
tector. One of the multicolour magnitude sys¬ 
tems used widely in photoelectric photometry is 
the UBV system developed in the early 1950’s 
by Harold L. Johnson and William W. Morgan. 
Magnitudes are measured through three filters, 
U = ultraviolet, B = blue and V = visual. Fig¬ 
ure 4.6 and Table 4.1 give the wavelength bands 
of these filters. The magnitudes observed through 
these filters are called U, B and V magnitudes, 
respectively. 

The UBV system was later augmented by 
adding more bands. One commonly used system 



Fig. 4.6 Relative transmission profiles of filters used in 
the UBVRI magnitude system. The maxima of the bands 
are normalised to unity. The R and I bands are based on 
the system of Johnson, Cousins and Glass, which includes 
also infrared bands J, H, K, L and M. Previously used R 
and I bands differ considerably from these. The curves of 
the ugriz magnitudes (dashed lines) give quantum efficien¬ 
cies. They include the atmospheric extinction for airmass 
1.3 (Sect. 4.5) 


Table 4.1 Wavelength bands of the UBVRI and uvby fil¬ 
ters and their effective average) wavelengths 



Magnitude 

Band 

width [nm] 

Effective 
wavelength [nm] 

u 

ultraviolet 

66 

367 

B 

blue 

94 

436 

V 

visual 

88 

545 

R 

red 

138 

638 

I 

infrared 

149 

797 

u 

ultraviolet 

30 

349 

V 

violet 

19 

411 

b 

blue 

18 

467 

y 

yellow 

23 

547 


is the five colour UBVRI system, which includes 
R = red and I = infrared filters. 

There are also other broad band systems, but 
they are not as well standardised as the UBV, 
which has been defined moderately well using 
a great number of standard stars all over the sky. 
The magnitude of an object is obtained by com¬ 
paring it to the magnitudes of standard stars. 

In Strdmgren ’s four-colour or uvby system , the 
bands passed by the filters are much narrower 
than in the UBV system. The uvby system is also 
well standardised, but it is not quite as common 
as the UBV. Other narrow band systems exist as 
well. By adding more filters, more information on 
the radiation distribution can be obtained. 
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In any multicolour system, we can define 
colour indices ; a colour index is the difference 
of two magnitudes. By subtracting the B magni¬ 
tude from U we get the colour index U — B, and 
so on. If the UBV system is used, it is common to 
give only the V magnitude and the colour indices 
U — B and B — V. 

The constants Fq in (4.8) for U, B and V mag¬ 
nitudes have been selected in such a way that the 
colour indices B — V and U — B are zero for 
stars of spectral type AO (for spectral types, see 
Chap. 8). The surface temperature of such a star is 
about 10,000 K. For example, Vega (a Lyr, spec¬ 
tral class AOV) has V = 0.04, B — V — U — B = 
0.00. The Sun has V — —26.8, B — V — 0.64 and 
U-B = 0.12. 

Before the UBV system was developed, a 
colour index C.I., defined as C.I. = ;n pg — m v 
was used. The definition shows that C.I. corre¬ 
sponds to the colour index B — V. In fact, C.I. = 
(B — V) — 0.11. 

Nowadays it is becoming more customary to 
use the AB system (AB solute), in which Fo is 
the same, 3631 Jy, for all wavelength bands. For 
instance the ugriz magnitudes used by the Sloan 
Digital Sky Survey (SDSS) are based on this sys¬ 
tem. There are several different transformation 
equations between the UBV and ugriz systems 
for different kinds of objects. For ordinary stars, 
we can use the following: 

V = g- 0.2906 (u - g) + 0.0885, 



Fig. 4.7 The apparent magnitude at a distance r depends 
on the flux density F(r). The absolute magnitude is de¬ 
fined as the apparent magnitude at a distance of 10 parsecs 
from the star depending only on the flux density F(T0) 10 
parsecs away from the star 

the apparent magnitude at a distance of 10 par¬ 
secs from the star (Fig. 4.7). Officially this defi¬ 
nition was accepted in the general meeting of the 
IAU in 1922. 

We shall now derive an equation which relates 
the apparent magnitude m, the absolute magni¬ 
tude M and the distance r. Because the flux em¬ 
anating from a star into a solid angle co has, at 
a distance r, spread over an area cor 2 , the flux 
density is inversely proportional to the distance 
squared. Therefore the ratio of the flux density at 
a distance r. Fir), to the flux density at a distance 
of 10 parsecs, F(10), is 


V = g- 0.5784 (g-r)- 0.0038, 
R — r — 0.1837 (g - r) - 0.0971, 
R = r - 0.2936 (r-i)-0. 1439, 
I = r — 1.2444 (r-i)- 0.3820, 
I = i- 0.3780 (,i -z) - 0.3974. 


(4.11) 


F(r) /10pc 

F(10) ~~ V r 


Thus the difference of magnitudes at r and 10 pc, 
or the distance modulus m — M, is 


m — M = —2.51g 


Fir) 

F(10) 


-2.5 lg 



2 


4.4 Absolute Magnitudes 

Thus far we have discussed only apparent mag¬ 
nitudes. They do not tell us anything about the 
true brightness of stars, since the distances differ. 
A quantity measuring the intrinsic brightness of 
a star is the absolute magnitude. It is defined as 


or 

m-M = 5 lg —^—. (4.12) 

10 pc 

For historical reasons, this equation is almost al¬ 
ways written as 

m — M — 5 lgr — 5, 


(4.13) 
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Fig. 4.8 The interstellar 
medium absorbs and 
scatters radiation; this 
usually reduces the energy 
flux L in the solid angle co 
(d L < 0) 


dr 



which is valid only if the distance is expressed in 
parsecs. (The logarithm of a dimensional quan¬ 
tity is, in fact, physically absurd.) Sometimes the 
distance is given in kiloparsecs or megaparsecs, 
which require different constant terms in (4.13). 
To avoid confusion, we highly recommend the 
form (4.12). 

Absolute magnitudes are usually denoted by 
capital letters. Note, however, that the U, B and 
V magnitudes are apparent magnitudes. The cor¬ 
responding absolute magnitudes are Mu, Mb and 
My. 

The absolute bolometric magnitude can be ex¬ 
pressed in terms of the luminosity. Let the total 
flux density at a distance r — 10 pc be F and let 
Fq be the equivalent quantity for the Sun. Since 
the luminosity is L = 4itr 2 F, we get 


(4.12) no longer holds, because part of the ra¬ 
diation is absorbed by the medium (and usually 
re-emitted at a different wavelength, which may 
be outside the band defining the magnitude), or 
scattered away from the line of sight. All these 
radiation losses are called the extinction. 

Now we want to find out how the extinction 
depends on the distance. Assume we have a star 
radiating a flux Lq into a solid angle co in some 
wavelength range. Since the medium absorbs and 
scatters radiation, the flux L will now decrease 
with increasing distance r (Fig. 4.8). In a short 
distance interval (r, r + dr], the extinction dL is 
proportional to the flux L and the distance trav¬ 
elled in the medium: 

dL = —aL dr. (4.15) 


or 


Mbol - Afbol.G = -2.5 lg 


F 


= —2-5 lg 


L/Anr 2 
L©/47rr 2 ’ 


A*bol - ZWbol.o = -2.5 lg 



(4.14) 


The factor a tells how effectively the medium can 
obscure radiation. It is called the opacity. From 
(4.15) we see that its dimension is [a] = m . 
The opacity is zero for a perfect vacuum and ap¬ 
proaches infinity when the substance becomes re¬ 
ally murky. We can now define a dimensionless 
quantity, the optical thickness r by 

dr = a dr. (4.16) 


The absolute bolometric magnitude Mb 0 ] = 0 cor- Substituting this into (4. 15) we get 
responds to a luminosity Lq = 3.0 x 10 28 W. 

dL = — L dr. 


4.5 Extinction and Optical 
Thickness 

Equation (4.12) shows how the apparent magni¬ 
tude increases (and brightness decreases!) with 
increasing distance. If the space between the ra¬ 
diation source and the observer is not completely 
empty, but contains some interstellar medium, 


Next we integrate this from the source (where 
L = Lq and r = 0) to the observer: 

r-=-/v 

J Lq L Jq 

which gives 


L = Lq£~ T . 


(4.17) 
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Here, r is the optical thickness of the material be¬ 
tween the source and the observer and L, the ob¬ 
served flux. Now, the flux L falls off exponen¬ 
tially with increasing optical thickness. Empty 
space is perfectly transparent, i.e. its opacity is 
or = 0; thus the optical thickness does not in¬ 
crease in empty space, and the flux remains con¬ 
stant. 

Let Fq be the flux density on the surface of 
a star and F{r), the flux density at a distance r. 
We can express the fluxes as 

L = a>r 2 F(r), L 0 = coR 2 F 0 , 

where R is the radius of the star. Substitution into 
(4.16) gives 


j^2 

F(r) = F 0 — e“ T . 
r- 

For the absolute magnitude we need the flux den¬ 
sity at a distance of 10 parsecs, F{ 10), which is 
still evaluated without extinction: 


F( 10) = F 0 


R 2 

(10 pc) 2 


The distance modulus m — M is now 


or 


m — M = —2.5 lg 


F(r) 

FU0) 


= 51g Typ-2.51ge _T 

10 pc 


= 51g-^ + (2.51ge)r 
10 pc 


where the constant a = 2.5or Ige gives the extinc¬ 
tion in magnitudes per unit distance. 

Colour Excess Another effect caused by the in¬ 
terstellar medium is the reddening of light : blue 
light is scattered and absorbed more than red. 
Therefore the colour index B — V increases. The 
visual magnitude of a star is, from (4. 1 8), 

V = My + 5 lg -J— + Ay, (4.20) 
10 pc 

where My is the absolute visual magnitude and 
Ay is the extinction in the V passband. Similarly, 
we get for the blue magnitudes 

B — Mb + 5 lg —-h Ab- 

10 pc 

The observed colour index is now 


B — V = Mg — My + Ab — Ay, 


or 


B — V = (B — V) 0 + E B -v. (4.21) 


where (B — V)o — Mb — My is the intrinsic 
colour of the star and £b-v = (B — V) — (B — 
V)o is the colour excess. Studies of the interstel¬ 
lar medium show that the ratio of the visual ex¬ 
tinction Ay to the colour excess £b-v is almost 
constant for all stars: 


-Eb-v 


This makes it possible to find the visual extinction 
if the colour excess is known: 


m M = 5 lg - -hA, (4.18) 

10 pc 


A v ^3.0£ B -v- (4.22) 


where A > 0 is the extinction in magnitudes due 
to the entire medium between the star and the ob¬ 
server. If the opacity is constant along the line of 
sight, we have 


When Ay is obtained, the distance can be solved 
directly from (4.19), when V and My are known. 

We shall study interstellar extinction in more 
detail in Sect. 15.1 (“Interstellar Dust”). 


r = a I dr = ar, 

Jo 

and (4.18) becomes 

m — M = 5 lg-h ar, (4.19) 

10 pc 


Atmospheric Extinction As we mentioned in 
Sect. 3.1, the Earth’s atmosphere also causes ex¬ 
tinction. The observed magnitude m depends on 
the location of the observer and the zenith dis¬ 
tance of the object, since these factors deter¬ 
mine the distance the light has to travel in the 
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Fig. 4.9 If the zenith distance of a star is z, the light of 
the star travels a distance H/ cos z in the atmosphere; H is 
the height of the atmosphere 

atmosphere. To compare different observations, 
we must first reduce them, i.e. remove the at¬ 
mospheric effects somehow. The magnitude ;«o 
thus obtained can then be compared with other 
observations. 

If the zenith distance z is not too large, we can 
approximate the atmosphere by a plane layer of 
constant thickness (Fig. 4.9). If the thickness of 
the atmosphere is used as a unit, the light must 
travel a distance 

X — 1/ cos z = sec z (4.23) 

in the atmosphere. The quantity X is the air mass. 
According to (4.18), the magnitude increases lin¬ 
early with the distance X: 

m — mo + kX, (4.24) 

where k is the extinction coefficient. 


The extinction coefficient can be determined 
by observing the same source several times dur¬ 
ing a night with as wide a zenith distance range 
as possible. The observed magnitudes are plot¬ 
ted in a diagram as a function of the air mass X. 
The points lie on a straight line the slope of 
which gives the extinction coefficient k. When 
this line is extrapolated to X — 0, we get the mag¬ 
nitude mo, which is the apparent magnitude out¬ 
side the atmosphere. 

In practice, observations with zenith distances 
higher than 70° (or altitudes less than 20°) are 
not used to determine k and mo, since at low al¬ 
titudes the curvature of the atmosphere begins to 
complicate matters. The value of the extinction 
coefficient k depends on the observation site and 
time and also on the wavelength, since extinction 
increases strongly towards short wavelengths. 


4.6 Examples 

Example 4.1 Show that intensity is independent 
of distance. 

Suppose we have some radiation leaving the 
surface element dA in the direction 6. The energy 
entering the solid angle d w in time dr is 

d E = I cos# dA dcodr, 

where I is the intensity. If we have another sur¬ 
face dA' at a distance r receiving this radiation 
from direction 9', we have 

Aw — AA' cos9'/r 2 . 



The definition of the intensity gives 


AE = I' cos 9' AA'dw' dr, 


where /' is the intensity at dA' and 
Aw' = AA cos 9/r 2 . 
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Substitution of Aco and do/ into the expressions 
of AE gives 


d A' cos 9' 

I cos 9 d 9 dA--- At 


, , ,AA cos 9 , 

= I cos9 AA --—df I' = I. 


The solid angle subtended by the Sun is 

&> = 7r^—^ = tt x 0.00465 2 

= 6.81 x 10 -5 sterad, 
and the surface brightness 


Thus the intensity remains constant in empty 
space. 

Example 4.2 (Surface Brightness of the Sun) 
Assume that the Sun radiates isotropically. Let R 
be the radius of the Sun, Fq the flux density on 
the surface of the Sun and F the flux density at 
a distance r. Since the luminosity is constant, 

L =4ttR 1 2 F q = 4ttr 2 F, 


C 

B= — =2.01 x 10 7 Wm“ 2 sterad -1 . 
to 

Example 4.3 (Magnitude of a Binary Star) 
Since magnitudes are logarithmic quantities, they 
can be a little awkward for some purposes. For 
example, we cannot add magnitudes like flux 
densities. If the magnitudes of the components 
of a binary star are 1 and 2, the total magnitude 
is certainly not 3. To find the total magnitude, we 
must first solve the flux densities from 


the flux density equals 

R 2 

F = F e~. 

yZ 

At a distance r y>> R, the Sun subtends a solid 
angle 


F\ 

l = -2.51g-i, 
Fo 

which give 


F 1 = F 0 x lO -0 ' 4 , 


2 = 


-2.5 lg 


Fi 

Fo’ 


F 2 = F 0 x lO -0 8 . 


Thus the total flux density is 


A 


n R 1 
r 2 


where A — ttR 2 is the cross section of the Sun. 
The surface brightness B is 



CO Tt 


F = + F 2 = F 0 (lO -0 ' 4 + 10 -0 ' 8 ) 

and the total magnitude, 

E 0 (10 -0 ' 4 + 10 -08 ) 


m — —2.5 lg ■ 


Fo 


=-2.5 lg0.5566 = 0.64. 


Applying (4.4) we get 


B = I q . 


Thus the surface brightness is independent of dis¬ 
tance and equals the intensity. We have found 
a simple interpretation for the somewhat abstract 
concept of intensity. 

The flux density of the Sun on the Earth, the 
solar constant, is Sq 1370 Wm -2 . The angular 
diameter of the Sun is a = 32', whence 


R a 

7 ~ 2 


1 32 n 

- x — x -= 0.00465 rad. 

2 60 180 


Example 4.4 The distance of a star is r — 
100 pc and its apparent magnitude m = 6. What 
is its absolute magnitude? 

Substitution into (4.12) 

m — M = 5 lg — 

10 pc 

gives 

100 

M = 6 — 5 lg-= 1. 

6 10 

Example 4.5 The absolute magnitude of a star 
is M — —2 and the apparent magnitude m = 8. 
What is the distance of the star? 
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We can solve the distance r from (4.12): 

r = 10 pc x 10 (m “ M)/5 = 10 x 10 10/5 pc 
= 1000 pc = 1 kpc. 

Example 4.6 Although the amount of interstel¬ 
lar extinction varies considerably from place to 
place, we can use an average value of 2 mag/kpc 
near the galactic plane. Find the distance of the 
star in Example 4.5, assuming such extinction. 
Now the distance must be solved from (4.19): 

8-(-2) = 51g^+0.002r, 

where r is in parsecs. This equation cannot be 
solved analytically, but we can always use a nu¬ 
merical method. We try a simple iteration (Ap¬ 
pendix A. 7), rewriting the equation as 

r = 10 x lO 2 - 0 0004 '-. 

The value r — 1000 pc found previously is a good 
initial guess: 

r 0 = 1000, 

n = 10 x io 2 “ 0 0004 x 1000 = 398, 
r 2 = 693, 

r \2 — n 3 = 584. 

The distance is r ^ 580 pc, which is much less 
than our earlier value 1000 pc. This should be 
quite obvious, since due to extinction, radiation 
is now reduced much faster than in empty space. 

Example 4.7 What is the optical thickness of 
a layer of fog, if the Sun seen through the fog 
seems as bright as a full moon in a cloudless sky? 

The apparent magnitudes of the Sun and the 
Moon are —26.8 and —12.5, respectively. Thus 
the total extinction in the cloud must be A = 14.3. 
Since 

A = (2.51ge)r, 

we get 

r = A/(2.51ge) = 14.3/1.086= 13.2. 


The optical thickness of the fog is 13.2. In reality, 
a fraction of the light scatters several times, and 
a few of the multiply scattered photons leave the 
cloud along the line of sight, reducing the total 
extinction. Therefore the optical thickness must 
be slightly higher than our value. 

Example 4.8 (Reduction of Observations) The 
altitude and magnitude of a star were measured 
several times during a night. The results are given 
in the following table. 


Altitude 

Zenith 

distance 

Air 

mass 

Magnitude 

50° 

40° 

1.31 

0.90 

35° 

55° 

1.74 

0.98 

25° 

65° 

2.37 

1.07 

20° 

70° 

2.92 

1.17 


By plotting the observations as in the follow¬ 
ing figure, we can determine the extinction coef¬ 
ficient k and the magnitude iiiq outside the atmo¬ 
sphere. This can be done graphically (as here) or 
using a least-squares fit. 



Extrapolation to the air mass X = 0 gives 
;«o = 0.68. The slope of the line gives k — 0.17. 


4.7 Exercises 

Exercise 4.1 The total magnitude of a triple star 
is 0.0. Two of its components have magnitudes 
1.0 and 2.0. What is the magnitude of the third 
component? 

Exercise 4.2 The absolute magnitude of a star 
in the Andromeda galaxy (distance 690 kpc) is 
M = 5. It explodes as a supernova, becoming one 
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billion (10 9 ) times brighter. What is its apparent 
magnitude? 

Exercise 4.3 Assume that all stars have the 
same absolute magnitude and stars are evenly 
distributed in space. Let N(m) be the number of 
stars brighter than m magnitudes. Find the ratio 
N(m + 1 )/N(m). 

Exercise 4.4 The V magnitude of a star is 15.1, 
B — V — 1.6, and absolute magnitude My = 1.3. 
The extinction in the direction of the star in the 


visual band is fly = 1 mag kpc 1 . What is the in¬ 
trinsic colour of the star? 

Exercise 4.5 Stars are observed through a triple 
window. Each surface reflects away 15 % of the 
incident light. 

(a) What is the magnitude of Regulus (My = 
1.36) seen through the window? 

(b) What is the optical thickness of the window? 
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In the previous chapters we have studied the 
physical properties and detection of electromag¬ 
netic radiation. Next we shall briefly discuss con¬ 
cepts related to emission and absorption of radi¬ 
ation. Since we can give here only a summary of 
some essential results without delving into quan¬ 
tum mechanical explanations, the reader inter¬ 
ested in the details is advised to consult any good 
physics textbook. 


5.1 Radiation of Atoms and 
Molecules 

Electromagnetic radiation is emitted or absorbed 
when an atom or a molecule moves from one en¬ 
ergy level to another. If the energy of the atom 
decreases by an amount A E, the atom emits or 
radiates a quantum of electromagnetic radiation, 
called a photon , whose frequency v is given by 
the equation 

AE = hv, (5.1) 

where h is the Planck constant, h = 6.6256 x 
10~ 34 Js. Similarly, if the atom receives or ab¬ 
sorbs a photon of a frequency v, its energy in¬ 
creases by A E — hv. 

The classical model describes an atom as 
a nucleus surrounded by a swarm of electrons. 
The nucleus consists of Z protons, each hav¬ 
ing a charge +e and N electrically neutral neu¬ 
trons; Z is the charge number of the atom and 
A — Z + IV is its mass number. A neutral atom 
has as many electrons (charge —e) as protons. 


An energy level of an atom usually refers to 
an energy level of its electrons. The energy E of 
an electron cannot take arbitrary values; only cer¬ 
tain energies are allowed: the energy levels are 
quantised. An atom can emit or absorb radiation 
only at certain frequencies u;f corresponding to 
energy differences between some initial and final 
states i and /: | E\ — Ef\ = hv if. This gives rise 
to the line spectrum, specific for each element 
(Fig. 5.1). Hot gas under low pressure produces 
an emission spectrum consisting of such discrete 
lines. If the same gas is cooled down and ob¬ 
served against a source of white light (which has 
a continuous spectrum), the same lines are seen 
as dark absorption lines. 

At low temperatures most atoms are in their 
lowest energy state, the ground state. Higher en¬ 
ergy levels are excitation states; a transition from 
lower to higher state is called excitation. Usually 
the excited atom will return to the lower state very 
rapidly, radiating a photon ( spontaneous emis¬ 
sion); a typical lifetime of an excited state might 
be 10 -8 seconds. The frequency of the emitted 
photon is given by (5.1). The atom may return to 
the lower state directly or through some interme¬ 
diate states, emitting one photon in each transi¬ 
tion. 

Downward transitions can also be induced 
by radiation. Suppose our atom has swallowed 
a photon and become excited. Another photon, 
whose frequency v corresponds to some possi¬ 
ble downward transition from the excited state, 
can now irritate the atom, causing it to jump to 
a lower state, emitting a photon with the same 
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Fig. 5.1 Origin of line spectra, (a) Emission spectrum. 
Atoms of glowing gas returning from excited states to 
lower states emit photons with frequencies corresponding 
to the energy difference of the states. Each element emits 
its own characteristic wavelengths, which can be mea- 

frequency v>. This is called induced or stimulated 
emission. Photons emitted spontaneously leave 
the atom randomly in all directions with random 
phases: the radiation is isotropic and incoherent. 
Induced radiation, on the other hand, is coherent, 
it propagates in the same direction as and in phase 
with the inducing radiation. 

Also collisions of atoms cause transitions 
up and down. In these transitions photons are 
not emitted, instead the kinetic energies of the 
atoms are changed. (An upward transition can, of 
course, cause e.g. a spontaneous emission.) The 
higher the density of the gas the more dominant 
the collisional transitions are. 

The zero level of the energy states is usually 
chosen so that a bound electron has negative en¬ 
ergy and a free electron positive energy (cf. the 
energy integral of planetary orbits, Chap. 6). If 
an electron with energy E < 0 receives more en¬ 
ergy than |£|, it will leave the atom, which be¬ 
comes an ion. In astrophysics ionisation is often 
called a bound-free transition (Fig. 5.2). Unlike 


sured by spreading the light into a spectrum with a prism 
or diffraction grating, (b) Absorption spectrum. When 
white light containing all wavelengths travels through gas, 
the wavelengths characteristic of the gas are absorbed 


El 

0 

Excited! 
states j 

Ground 

state 








































Absorbtion Emission Ionization Recombination Free-free 


Fig. 5.2 Different kinds of transitions between energy 
levels. Absorption and emission occur between two bound 
states, whereas ionisation and recombination occur be¬ 
tween a bound and a free state. Interaction of an atom with 
an free electron can result in a free-free transition 

in excitation all values of energy (E > 0) are now 
possible. The extraneous part of the absorbed en¬ 
ergy goes to the kinetic energy of the liberated 
electron. The inverse process, in which an atom 
captures a free electron, is the recombination or 
free-bound transition. 
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Fig. 5.3 Polarisation of light. The light of an incandes¬ 
cent bulb contains all possible directions of vibration and 
is therefore unpolarised. Some crystals, for example, pass 
electric fields oscillating only in certain directions, and the 
transmitted part of the light becomes linearly polarised. 
E is the electric field and B the magnetic field 

When an electron scatters from a nucleus or an 
ion without being captured, the electromagnetic 
interaction can change the kinetic energy of the 
electron producing free-free radiation. In a very 
hot gas (T > 10 6 K) hydrogen is fully ionised, 
and the free-free radiation is the most important 
source of emission. It is then usually called ther¬ 
mal bremsstrahlung. The latter part of the name 
derives from the fact that decelerating electrons 
hitting the anode of an X-ray tube emit similar ra¬ 
diation. In an analogous way the absorption pro¬ 
cess can be called a bound-bound transition. 

Electromagnetic radiation is transverse wave 
motion; the electric and magnetic fields oscillate 
perpendicular to each other and also perpendic¬ 
ular to the direction of propagation. The light of 
an ordinary incandescent lamp has a random dis¬ 
tribution of electric fields vibrating in all direc¬ 
tions. If the directions of electric fields in the 
plane perpendicular to the direction of propaga¬ 
tion are not evenly distributed, the radiation is 
polarised (Fig. 5.3). The direction of polarisation 
of linearly polarised light means the plane deter¬ 
mined by the electric vector and the direction of 
the light ray. If the electric vector describes a cir¬ 
cle, the radiation is circularly polarised. If the 
amplitude of the electric field varies at the same 
time, the polarisation is elliptic. 

If polarised radiation travels through a mag¬ 
netic field, the direction of the polarisation will 
rotate. The amount of such Faraday rotation is 
proportional to the component of the magnetic 
field parallel to the line of sight, number of elec¬ 
trons along the line of sight, distance travelled, 
and square of the wavelength of the radiation. 

Scattering is an absorption followed by an 
instantaneous emission at the same wavelength 


but usually in a new direction. On the macro¬ 
scopic scale, radiation seems to be reflected by 
the medium. The light coming from the sky is 
sunlight scattered from atmospheric molecules. 
Scattered light is always polarised, the degree of 
polarisation being highest in the direction perpen¬ 
dicular to the direction of the original radiation. 


5.2 The Hydrogen Atom 

The hydrogen atom is the simplest atom, consist¬ 
ing of a proton and an electron. According to the 
Bohr model the electron orbits the proton in a cir¬ 
cular orbit. (In spite of the fact that this model has 
very little to do with reality, it can be successfully 
used to predict some properties of the hydrogen 
atom.) Bohr’s first postulate says that the angu¬ 
lar momentum of the electron must be a multiple 
of h: 

invr — nH, (5.2) 

where 

m = mass of the electron, 
v = speed of the electron, 
r = radius of the orbit, 
n = the principal quantum number, 
n= 1,2,3,..., 
h= h/2n, 

h = the Planck constant. 

The quantum mechanical interpretation of 
Bohr’s first postulate is obvious: the electron is 
represented as a standing wave, and the “length 
of the orbit” must be a multiple of the de Broglie 
wavelength, X = h/p — H/mv. 

A charged particle in a circular orbit (and thus 
in accelerated motion) should emit electromag¬ 
netic radiation, losing energy, were it to obey the 
rules of classical electrodynamics. Therefore our 
electron should spiral down towards the nucleus. 
But obviously, Nature does not behave this way, 
and we have to accept Bohr’s second postulate, 
which says that an electron moving in an allowed 
orbit around a nucleus does not radiate. Radia- 
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Fig. 5.4 Transitions of 
a hydrogen atom. The 
tower picture shows a part 
of the spectrum of the star 
HD193182. On both sides 
of the stellar spectrum we 
see an emission spectrum 
of iron. The wavelengths of 
the reference lines are 
known, and they can be 
used to calibrate the 
wavelengths of the 
observed stellar spectrum. 
The hydrogen Balmer lines 
are seen as dark absorption 
lines converging towards 
the Balmer ionisation limit 
(also called the Balmer 
discontinuity) at 
A = 364.7 nm to the left. 
The numbers (15,..., 40) 
refer to the quantum 
number n of the higher 
energy level. (Photo by 
Mt. Wilson Observatory) 



tion is emitted only when the electron jumps from 
a higher energy state to a lower one. The emitted 
quantum has an energy hv, equal to the energy 
difference of these states: 

hv=E, n -E n j. (5.3) 

We shall now try to find the energy of an electron 
in the state E n . Coulomb’s law gives the force 
pulling the electron towards the proton: 


The acceleration of a particle moving in a cir¬ 
cular orbit of radius r„ is 


a 


r 


2 

n 


r n 


and applying Newton’s second law (F = ma ), we 
get 


mv 2 1 e 2 
r n 4jte 0 r 2 ' 


(5.5) 


F = 


1 e 2 
4jre 0 r 2 ’ 


where 


From (5.2) and (5.5) it follows that 

(5.4) 

e 2 1 47T€ofi 2 2 

t in — , . — , T n = ^ fl . 

4jreo nn me¬ 


et) — the vacuum permittivity 
= 8.85 x 10“ 12 N -1 m” 2 C 2 , 
e — the charge of the electron =1.6xlO -19 C, 


The total energy of an electron in the orbit n is 
now 


E n = T + V — -m v~ 


1 e 2 
4ire 0 r n 


me 4 1 

32jr 2 eQh 2 n 2 



r n — the distance between the electron and the 
proton. 


(5.6) 
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where C is a constant. For the ground state (n — 
1), we get from (5.6) 


Ei = -2.18 x 10“ 18 J = -13.6 eV. 


From (5.3) and (5.6) we get the energy of the 
quantum emitted in the transition E„ 2 —*■ E ni : 



In terms of the wavelength k this can be ex¬ 
pressed as 

1 _ u _ C / 1 l\_ 

k c he \/?[ 

(5.8) 

where R is the Rydberg constant , R = 1.097 x 
10 7 m -1 . 

Equation (5.8) was derived experimentally for 
«i = 2 by Johann Jakob Balmer as early as 1885. 
That is why we call the set of lines produced by 
transitions E„ —> Ei the Balmer series. These 
lines are in the visible part of the spectrum. For 
historical reasons the Balmer lines are often de¬ 
noted by symbols H„, H^, H y etc. If the elec¬ 
tron returns to its ground state ( E n —► E\ ), we 
get the Lyman series, which is in the ultravio¬ 
let. The other series with specific names are the 
Paschen series (n i = 3), Bracket series (n i = 4) 
and Pfund series (ni = 5) (see Fig. 5.4). 



5.3 Line Profiles 

The previous discussion suggests that spectral 
lines would be infinitely narrow and sharp. In re¬ 
ality, however, they are somewhat broadened. We 
will now consider briefly the factors affecting the 
shape of a spectral line, called a line profile. An 
exact treatment would take us too deep into quan¬ 
tum mechanics, so we cannot go into the details 
here. 

According to quantum mechanics everything 
cannot be measured accurately at the same time. 
For example, even in principle, there is no way 
to determine the x coordinate and the momen¬ 
tum p x in the direction of the x axis with arbi¬ 
trary precision simultaneously. These quantities 



Fig. 5.5 Each spectral line has its characteristic natural 
width (solid line). Motions of particles broaden the line 
further due to the Doppler effect, resulting in the Voigt 
profile (dashed line). Both profiles have the same area 


have small uncertainties Ax and A p x , such that 
Ax A p x & h. 

A similar relation holds for other directions, too. 
Time and energy are also connected by an uncer¬ 
tainty relation, 


AE At & h. 

The natural width of spectral lines is a conse¬ 
quence of this Heisenberg uncertainty principle. 

If the average lifetime of an excitation state 
is T , the energy corresponding to the transi¬ 
tion can only be determined with an accuracy 
of A E — h/T — h/(2nT). From (5.1) it follows 
that Av = A E/h. In fact, the uncertainty of the 
energy depends on the lifetimes of both the ini¬ 
tial and final states. The natural width of a line is 
defined as 


Y = 


A E{ + A£f 
h 


l 

Ti 


1 

Tf 


(5.9) 


It can be shown that the corresponding line profile 
is 


Y_ _To_ 

2n (v - v 0 ) 2 + y 2 /4' 


(5.10) 
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where vq is the frequency at the centre of the line 
and /o the total intensity of the line. At the centre 
of the line the intensity per frequency unit is 


and at the frequency v — t>o + y /2, 

1 1 

A’o+y/2 — ^0 — xA’o- 

Tty l 

Thus the width y is the width of the line profile 
at a depth where the intensity is half of the max¬ 
imum. This is called the full width at half maxi¬ 
mum (FWHM). 



Fig. 5.6 The full width at half maximum (FWHM) of 
a spectral line is the width at the depth where the inten¬ 
sity is half of the maximum. The equivalent width W is 
defined so that the line and the shaded rectangle have the 
same area in the picture. The two measures are not gen¬ 
erally the same, although they usually are close to each 
other 


Doppler Broadening Atoms of a gas are mov¬ 
ing the faster the higher the temperature of the 
gas. Thus spectral lines arising from individual 
atoms are shifted by the Doppler effect. The ob¬ 
served line consists of a collection of lines with 
different Doppler shifts, and the shape of the line 
depends on the number of atoms with different 
velocities. 

Each Doppler shifted line has its character¬ 
istic natural width. The resulting line profile is 
obtained by giving each Doppler shifted line 
a weight proportional to the number of atoms 
given by the velocity distribution and integrat¬ 
ing over all velocities. This gives rise to the Voigt 
profile (Fig. 5.5), which already describes most 
spectral lines quite well. The shapes of different 
profiles don’t seem very different; the most ob¬ 
vious consequence of the broadening is that the 
maximum depth decreases. 

One way to describe the width of a line is to 
give its full width at half maximum (Fig. 5.6). 
Due to Doppler broadening this is usually greater 
than the natural width. The equivalent width is 
another measure of a line strength. It is the area 
of a rectangular line that has the same area as 
the line profile and that emits no light at all. The 
equivalent width can be used to describe the en¬ 
ergy corresponding to a line independently of the 
shape of the line profile. 


5.4 Quantum Numbers, Selection 
Rules, Population Numbers 

Quantum Numbers The Bohr model needs 
only one quantum number, n, to describe all the 
energy levels of the electron. This can explain 
only the coarse features of an atom with a single 
electron. 

Quantum mechanics describes the electron as 
a three dimensional wave, which only gives the 
probability of Ending the electron in a certain 
place. Quantum mechanics has accurately pre¬ 
dicted all the energy levels of hydrogen atoms. 
The energy levels of heavier atoms and molecules 
can also be computed; however, such calculations 
are very complicated. Also the existence of quan¬ 
tum numbers can be understood from the quan¬ 
tum mechanical point of view. 

The quantum mechanical description involves 
four quantum numbers, one of which is our n, the 
principal quantum number. The principal quan¬ 
tum number describes the quantised energy lev¬ 
els of the electron. The classical interpretation of 
discrete energy levels allows only certain orbits 
given by (5.6). The orbital angular momentum of 
the electron is also quantised. This is described by 
the angular momentum quantum number l. The 
angular momentum corresponding to a quantum 
number / is 


L = y/l(l+l)H. 
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The classical analogy would be to allow some el¬ 
liptic orbits. The quantum number 1 can take only 
the values 

/ = 0, 1,..., n — 1. 

For historical reasons, these are often denoted by 
the letters s, p, d , /, g, h, i, j. 

Although / determines the magnitude of the 
angular momentum, it does not give its direction. 
In a magnetic field this direction is important, 
since the orbiting electron also generates a tiny 
magnetic field. In any experiment, only one com¬ 
ponent of the angular momentum can be mea¬ 
sured at a time. In a given direction z (e.g. in the 
direction of the applied magnetic field), the pro¬ 
jection of the angular momentum can have only 
the values 

L z = miH, 

where m/ is the magnetic quantum number 

mi = 0 , ± 1 , ± 2 ,..., ±/. 

The magnetic quantum number is responsible 
for the splitting of spectral lines in strong mag¬ 
netic fields, known as the Zeeman effect. For ex¬ 
ample, if Z = 1, m; can have 21 + 1 = 3 different 
values. Thus, the line arising from the transition 
/ = 1 —> I — 0 will split into three components in 
a magnetic field (Fig. 5.7). 

The fourth quantum number is the spin de¬ 
scribing the intrinsic angular momentum of the 
electron. The spin of the electron is 

S = -y /s(s + \)h, 

where the spin quantum number is s = In 
a given direction z, the spin is 

S z = m s li , 

where m s can have one of the two values: 



All particles have a spin quantum number. Par¬ 
ticles with an integral spin are called bosons (pho¬ 
ton, mesons); particles with a half-integral spin 


m.) 



Fig. 5.7 The Zeeman effect. In strong magnetic fields 
each energy level of a hydrogen atom splits into (2/ + 1) 
separate levels, which correspond to different values of 
the magnetic quantum number mi =1, l — l,... ,—l. The 
energy differences of the successive levels have the same 
constant value A E. For example the p state (/ = 1) splits 
into three and the d state (/ = 2) into five sublevels. The 
selection rules require that in electric dipole transitions 
A mi equals 0 or ±1, and only nine different transitions 
between p and d states are possible. Moreover, the transi¬ 
tions with the same Am/ have the same energy difference. 
Thus the spectrum has only three separate lines 

are fermions (proton, neutron, electron, neutrino 
etc.). 

Classically, spin can be interpreted as the ro¬ 
tation of a particle; this analogy, however, should 
not be taken too literally. 

The total angular momentum J of an electron 
is the sum of its orbital and spin angular momen¬ 
tum: 

J = L + S. 

Depending on the mutual orientation of the vec¬ 
tors L and S the quantum number j of total an¬ 
gular momentum can have one of two possible 
values, 



(except if / = 0, when j = \). The z component 
of the total angular momentum can have the val¬ 
ues 

mj =0,±1,±2,...± j. 
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Spin also gives rise to the fine structure of 
spectral lines. Lines appear as close pairs or dou¬ 
blets. 

Selection Rules The state of an electron can¬ 
not change arbitrarily; transitions are restricted 
by selection rules, which follow from certain con¬ 
servation laws. The selection rules express how 
the quantum numbers must change in a transi¬ 
tion. Most probable are the electric dipole transi¬ 
tions, which make the atom behave like an oscil¬ 
lating dipole. The conservation laws require that 
in a transition we have 

Al = ±1, 

A mi = 0, ±1. 

In terms of the total angular momentum the se¬ 
lection rules are 

Al — ±1, 

Ay = 0, ±1, 

A m j — 0, ±1. 

The probabilities of all other transitions are 
much smaller, and they are called forbidden tran¬ 
sitions', examples are magnetic dipole transitions 
and all quadrupole and higher multipole transi¬ 
tions. 

Spectral lines originating in forbidden transi¬ 
tions are called forbidden lines. The probability 
of such a transition is so low that under normal 
circumstances, the transition cannot take place 
before collisions force the electron to change 
state. Forbidden lines are possible only if the gas 
is extremely rarefied (like in auroras and plane¬ 
tary nebulae). 

The spins of an electron and nucleus of a hy¬ 
drogen atom can be either parallel or antipar¬ 
allel (Fig. 5.8). The energy of the former state 
is 0.0000059 eV higher. But the selection rules 
make an electric dipole transition between these 
states impossible. The transition, which is a mag¬ 
netic dipole transition, has a very low probabil¬ 
ity, A = 2.8 x 10“ 15 s _l . This means that the av¬ 
erage lifetime of the higher state is T = 1 /A = 
11 x 10 6 years. Usually collisions change the 
state of the electron well before this period of 
time has elapsed. But in interstellar space the den¬ 
sity of hydrogen is so low and the total amount of 



Fig. 5.8 The origin of the hydrogen 21 cm line. The spins 
of the electron and the proton may be either parallel or 
opposite. The energy of the former state is slightly larger. 
The wavelength of a photon corresponding to a transition 
between these states is 21 cm 


hydrogen so great that a considerable number of 
these transitions can take place. 

The wavelength of the radiation emitted by 
this transition is 21 cm, which is in the radio band 
of the spectrum. Extinction at radio wavelengths 
is very small, and we can observe more distant 
objects than by using optical wavelengths. The 
21 cm radiation has been of crucial importance 
for surveys of interstellar hydrogen. 

Population Numbers The population num¬ 
ber tij of an energy state i means the number 
of atoms in that state per unit volume. In ther¬ 
mal equilibrium, the population numbers obey 
the Boltzmann distribution: 

= (511) 
no go 

where T is the temperature, k is the Boltzmann 
constant, AE — E, — Eq — hv is the energy dif¬ 
ference between the excited and ground state, and 
gi is the statistical weight of the level i (it is 
the number of different states with the same en¬ 
ergy £)■). The subscript 0 always refers to the 
ground state. Often the population numbers dif¬ 
fer from the values given by (5.11), but still we 
can define an excitation temperature T exc in such 
a way that (5.11) gives correct population num¬ 
bers, when T is replaced by 7' exc . The excitation 
temperature may be different for different energy 
levels. 
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Fig. 5.9 Spectrum of carbon monoxide CO from 430 nm 
to 670 nm. The various bands correspond to different vi¬ 
brational transitions. Each band is composed of numerous 
rotational lines. Near the right edge of each band the lines 


5.5 Molecular Spectra 

The energy levels of an atom are determined by 
its electrons. In the case of a molecule, there 
are many more possibilities: atoms can vibrate 
around their equilibria and the molecule can ro¬ 
tate around some axis. Both vibrational and rota¬ 
tional states are quantised. Transitions between 
successive vibrational states typically involve 
photons in the infrared band, while transitions be¬ 
tween rotational states involve photons in the mi¬ 
crowave band. These combined with transitions 
of electrons produce a band spectrum, character¬ 
istic for molecules (Fig. 5.9). The spectrum has 
several narrow bands composed of a great num¬ 
ber of lines. 


5.6 Continuous Spectra 

We have already mentioned some processes that 
produce continuous spectra. Continuous emission 
spectra can originate in recombinations and free- 
free transitions. In recombination, an atom cap¬ 
tures a free electron whose energy is not quan¬ 
tised; in free-free transitions, both initial and fi¬ 
nal states are unquantised. Thus the emission line 
can have any frequency whatsoever. Similarly, 
ionisations and free-free transitions can give rise 
to a continuous absorption spectrum. 

Each spectrum contains a continuous compo¬ 
nent, or continuum, and spectral lines. Some¬ 
times. however, the lines are so closely packed 
and so broad that they seem to form a nearly con¬ 
tinuous spectrum. 

When the pressure of hot gas is increased, the 
spectral lines begin to broaden. At high pressure, 
atoms bump into each other more frequently, and 
the close neighbours disturb the energy levels. 


are so closely packed that they overlap and at this res¬ 
olution, the spectrum looks continuous. (R.W.B. Pearse, 
A.G. Gaydon: The Identification of Molecular Spectra 
(Chapman & Hall Ltd., London 1976) p. 394) 

When the pressure is high enough, the lines be¬ 
gin to overlap. Thus the spectrum of hot gas at 
high pressure is continuous. Electric fields also 
broaden spectral lines (the Stark effect). 

In liquids and solids the atoms are more 
densely packed than in gaseous substances. Their 
mutual perturbations broaden the energy levels, 
producing a continuous spectrum. 


5.7 Blackbody Radiation 

A blackbody is defined as an object that does 
not reflect or scatter radiation shining upon it, 
but absorbs and re-emits the radiation completely. 
A blackbody is a kind of an ideal radiator, which 
cannot exist in the real world. Yet many objects 
behave very much as if they were blackbodies. 

The radiation of a blackbody depends only on 
its temperature, being perfectly independent of 
its shape, material and internal constitution. The 
wavelength distribution of the radiation follows 
Planck’s law, which is a function of temperature 
only. The intensity at a frequency v of a black¬ 
body at temperature T is 

2 hv 3 1 

B v (T) = B(v, T) = — eM(m _ i , (5.12) 
where 

h — the Planck constant = 6.63 x 10 -34 J s, 
c = the speed of light «3 x 10 s m s _ 1 , 
k — the Boltzmann constant 
= 1.38 x 1CT 23 JKT 1 . 

By definition of the intensity, the dimension of B v 
is W m“ 2 Hz~ 1 sterad -1 . 
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Blackbody radiation can be produced in a 
closed cavity whose walls absorb all radiation in¬ 
cident upon them (and coming from inside the 
cavity). The walls and the radiation in the cavity 
are in equilibrium; both are at the same temper¬ 
ature, and the walls emit all the energy they re¬ 
ceive. Since radiation energy is constantly trans¬ 
formed into thermal energy of the atoms of the 
walls and back to radiation, the blackbody radia¬ 
tion is also called thermal radiation. 

The spectrum of a blackbody given by Planck’s 
law (5.12) is continuous. This is true if the size of 
the radiator is very large compared with the dom¬ 
inant wavelengths. In the case of the cavity, this 
can be understood by considering the radiation as 
standing waves trapped in the cavity. The num¬ 
ber of different wavelengths is larger, the shorter 
the wavelengths are compared with the size of 
the cavity. We already mentioned that spectra of 
solid bodies are continuous; very often such spec¬ 
tra can be quite well approximated by Planck’s 
law. 

We can also write Planck’s law as a function of 
the wavelength. We require that B v dv = — B, dk. 
The wavelength decreases with increasing fre¬ 
quency; hence the minus sign. Since v = c/X, we 
have 


dv c 
dk ~ ~Xf’ 

whence 


(5.13) 


dv c 

* = ~ B * 


(5.14) 


Let us now try to find the total intensity using 
the first of these integrals: 



B V (T) dv 


2 h f°° v 3 dv 

^ Jo eW(m _ i ■ 


We now change the integration variable to x — 
hv/(kT), whence dv = (kT/h) dx: 


2h k 4 



x 3 dx 
e A — 1 


The definite integral in this expression is just 
a real number, independent of the temperature. 
Thus we find that 


B(T) = AT 4 , (5.16) 


where the constant A has the value 
2 k 4 7r 4 

A = ?pT5' 


(5.17) 


(In order to get the value of A we have to eval¬ 
uate the integral. There is no elementary way to 
do that. We can tell those who are familiar with 
all the exotic functions so beloved by theoreti¬ 
cal physicists, that the integral can rather easily 
be expressed as r (4)f(4), where £ is the Rie- 
mann zeta function and r is the gamma func¬ 
tion. For integral values, r(n) is simply the fac¬ 
torial (n — 1)!. The difficult part is showing that 
C, (A) = jt 4 /90. This can be done by expanding 
x 4 — x 2 as a Fourier-series and evaluating the se¬ 
ries at x = 7T.) 

The flux density F for isotropic radiation of 
intensity B is (Sect. 4.1): 


or 


F —jtB 


Bx ^ T) X 5 e^cKXkT) _ i ’ 

[BJ = Wm _2 m _1 sterad -1 . 


(5.15) 


The functions B v and B, are defined in such 
a way that the total intensity can be obtained in 
the same way using either of them: 


B(T) = 



B v dv 



B\ dL. 


or 

F = crT 4 . (5.18) 

This is the Stefan-Boltzmann law, and the con¬ 
stant o (— jt A) is the Stefan-Boltzmann constant, 

a = 5.67 x 10“ 8 Wnf 2 K -4 . 

From the Stefan-Boltzmann law we get a re¬ 
lation between the luminosity and temperature of 
a star. If the radius of the star is R, its surface 
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area is Arc R 2 , and if the flux density on the sur¬ 
face is F, we have 

L — AjtR 2 F. 

If the star is assumed to radiate like a blackbody, 
we have F — oT 4 , which gives 

L = AnoR 1 T 4 . (5.19) 

In fact this defines the effective temperature of the 
star, discussed in more detail in the next section. 

The luminosity, radius and temperature of 
a star are interdependent quantities, as we can see 
from (5.19). They are also related to the absolute 
bolometric magnitude of the star. Equation (4.13) 
gives the difference of the absolute bolometric 
magnitude of the star and the Sun: 

M bol - M bol , 0 = -2.5 lg (5.20) 

L e 

But we can now use (5.19) to express the lumi¬ 
nosities in terms of the radii and temperatures: 

M bo i-M bo i, G = -51g-^-101g^. (5.21) 

Kq 1q 

As we can see in Fig. 5.10, the wavelength of 
the maximum intensity decreases with increas¬ 
ing total intensity (equal to the area below the 
curve). We can find the wavelength A max corre¬ 
sponding to the maximum intensity by differen¬ 
tiating Planck’s function BxiT ) with respect to X 
and finding zero of the derivative. The result is 
the Wien displacement law: 



Fig. 5.10 Intensity distributions of blackbodies at tem¬ 
perature 12,000 K, 9000 K and 6000 K. Since the ratios 
of the temperatures are 4:3:2, the wavelengths of inten¬ 
sity maxima given by the Wien displacement law are in 
the proportions 1:4, 1:3 and 1:2, or 3, 4 and 6. The ac¬ 
tual wavelengths of the maxima are 241.5 nm, 322 nm and 
483 nm. The total intensities or the areas below the curves 
are proportional to 4 4 , 3 4 and 2 4 

approximated by simpler expressions. When X ~ 
A.max (or hc/iXkT) 1), we have 

e hc/(kkT) ^ I 

In this case we get the Wien approximation 

B\(T) «= 2 h!_ e -hcl(kkT )_ (5.23) 

X-’ 

When hc/{XkT ) 1 (X A max ), we have 


X m . dx T = b = const, (5.22) 


e hc/kkT ^ i + hc /(x kT ), 


where the Wien displacement constant b is 
b — 0.0028978 Km. 

We can use the same procedure to find the 
maximum of B v . But the frequency v max thus ob¬ 
tained is different from v max = c/A. max given by 
(5.22). The reason for this is the fact that the 
intensities are given per unit frequency or unit 
wavelength, and the dependence of frequency on 
wavelength is nonlinear. 

When the wavelength is near the maximum or 
much longer than A max Planck’s function can be 


which gives the Rayleigh-Jeans approximation 


Bx(T ) 


2 he 2 XkT 2 ckT 

X 4 


(5.24) 


This is particularly useful in radio astronomy. 

Classical physics predicted only the Rayleigh- 
Jeans approximation. Were (5.24) true for all 
wavelengths, the intensity would grow beyond 
all limits when the wavelength approaches zero, 
contrary to observations. This contradiction was 
known as the ultraviolet catastrophe. 
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5.8 Temperatures 

Temperatures of astronomical objects range from 
almost absolute zero to millions of degrees. Tem¬ 
perature can be defined in a variety of ways, and 
its numerical value depends on the specific def¬ 
inition used. All these different temperatures are 
needed to describe different physical phenomena, 
and often there is no unique ‘true’ temperature. 

Often the temperature is determined by com¬ 
paring the object, a star for instance, with a black- 
body. Although real stars do not radiate exactly 
like blackbodies, their spectra can usually be ap¬ 
proximated by blackbody spectra after the effect 
of spectral lines has been eliminated. The result¬ 
ing temperature depends on the exact criterion 
used to fit Planck’s function to observations. 

The most important quantity describing the 
surface temperature of a star is the effective tem¬ 
perature T e . It is defined as the temperature of 
a blackbody which radiates with the same total 
flux density as the star. Since the effective temper¬ 
ature depends only on the total radiation power 
integrated over all frequencies, it is well defined 
for all energy distributions even if they deviate far 
from Planck’s law. 

In the previous section we derived the Stefan- 
Boltzmann law, which gives the total flux den¬ 
sity as a function of the temperature. If we now 
find a value T e of the temperature such that the 
Stefan-Boltzmann law gives the correct flux den¬ 
sity F on the surface of the star, we have found 
the effective temperature. The flux density on the 
surface is 

T = CTT e 4 . (5.25) 

The total flux is L = 4nR 2 F, where R is the ra¬ 
dius of the star, and the flux density at a distance r 
is 



where a = 2R/r is the observed angular diame¬ 
ter of the star. For direct determination of the ef¬ 
fective temperature, we have to measure the total 
flux density and the angular diameter of the star. 
This is possible only in the few cases in which the 
diameter has been found by interferometry. 


If we assume that at some wavelength X the 
flux density F k on the surface of the star is ob¬ 
tained from Planck’s law, we get the brightness 
temperature T b . In the isotropic case we have then 
Fx = Jt B k (T h ). If the radius of the star is R and 
distance from the Earth r, the observed flux den¬ 
sity is 



Again Fx can be determined only if the angu¬ 
lar diameter a is known. The brightness tempera¬ 
ture 7b can then be solved from 

F[=(^\ nB x (T b ). (5.27) 

Since the star does not radiate like a blackbody, 
its brightness temperature depends on the partic¬ 
ular wavelength used in (5.27). 

In radio astronomy, brightness temperature is 
used to express the intensity (or surface bright¬ 
ness) of the source. If the intensity at frequency v 
is /„, the brightness temperature is obtained from 

I v = B v (T b ). 


7j, gives the temperature of a blackbody with the 
same surface brightness as the observed source. 

Since radio wavelengths are very long, the 
condition hv <-£ kT of the Rayleigh-Jeans ap¬ 
proximation is usually satisfied (except for mil¬ 
limetre and submillimetre bands), and we can 
write Planck’s law as 

o ^_2/iv 3 1 

B ^ T b) ~ c 2 e l, v /(kT b ) _ l 

_ 2hu 3 1 

l + hv/(kT b ) + -1 



Thus we get the following expression for the ra¬ 
dio astronomical brightness temperature: 


T b = 




(5.28) 


A measure of the signal registered by a radio 
telescope is the antenna temperature T.\. After 
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Fig. 5.11 Determination of the colour temperature. The 
ratio of the flux densities at wavelengths A.i and X 2 gives 
the temperature of a blackbody with the same ratio. In gen¬ 
eral the result depends on the wavelengths chosen 


the antenna temperature is measured, we get the 
brightness temperature from 


7a = ??7b, (5.29) 

where i] is the beam efficiency of the antenna 
(typically 0.4 < r] < 0.8). Equation (5.29) holds 
if the source is wide enough to cover the whole 
beam, i.e. the solid angle £2 A from which the an¬ 
tenna receives radiation. If the solid angle sub¬ 
tended by the source, £2$, is smaller than £2 A , the 
observed antenna temperature is 

T A = r,^T h (£2 S < £2 a). (5.30) 

i2 A 

The colour temperature T c can be determined 
even if the angular diameter of the source is un¬ 
known (Fig. 5.11). We only have to know the 
relative energy distribution in some wavelength 
range [A 1 , ; the absolute value of the flux is not 
needed. The observed flux density as a function 
of wavelength is compared with Planck’s func¬ 
tion at different temperatures. The temperature 
giving the best fit is the colour temperature in the 
interval [A-i, /o]. The colour temperature is usu¬ 
ally different for different wavelength intervals, 
since the shape of the observed energy distribu¬ 
tion may be quite different from the blackbody 
spectrum. 

A simple method for finding a colour tem¬ 
perature is the following. We measure the flux 
density F' } at two wavelengths X\ and A. 2 . If 
we assume that the intensity distribution follows 
Planck’s law, the ratio of these flux densities must 


be the same as the ratio obtained from Planck’s 
law: 

F L B x , (T) A.5 e hc Hte kT ) - 1 

= = x <531) 

The temperature T solved from this equation is 
a colour temperature. 

The observed flux densities correspond to cer¬ 
tain magnitudes mx t and m\ 2 . The definition of 
magnitudes gives 


mx 1 - mx 2 = -2.5 lg -^ + const, 

F x 2 

where the constant term is a consequence of the 
different zero points of the magnitude scales. If 
the temperature is not too high, we can use the 
Wien approximation in the optical part of the 
spectrum: 


Bx, 

m Xi - mx 2 = -2.5 lg — + const 

Bx 2 



he £ 1 1 \ 

+ 2 ' 5 tTU'W lge + C ° nSt ' 

This can be written as 


mx 1 - mx 2 = a + b/T c , (5.32) 

where a and b are constants. This shows that 
there is a simple relationship between the differ¬ 
ence of two magnitudes and the colour tempera¬ 
ture. 

Strictly speaking, the magnitudes in (5.32) are 
monochromatic, but the same relation can be also 
used with broadband magnitudes like B and V. In 
that case, the two wavelengths are essentially the 
effective wavelengths of the B and V bands. The 
constant is chosen so that B — V — 0 for stars 
of the spectral type AO (see Chap. 9). Thus the 
colour index B — V also gives a colour tempera¬ 
ture. 

The kinetic temperature 1\, is related to the 
average speed of gas molecules. The kinetic en¬ 
ergy of an ideal gas molecule as a function of 
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temperature follows from the kinetic gas theory: 


Kinetic energy = —m ir 



Solving for 7k we get 


7i< = 


mv 2 

~W' 


(5.33) 


where m is the mass of the molecule, v its average 
velocity (or rather its r.m.s velocity, which means 
that v 2 is the average of the squared velocities), 
and k, the Boltzmann constant. For ideal gases 
the pressure is directly proportional to the kinetic 
temperature (cf. Box 11.1): 



P = nkT k, (5.34) 

where n is the number density of the molecules 
(molecules per unit volume). We previously de¬ 
fined the excitation temperature T exc as a tem¬ 
perature which, if substituted into the Boltzmann 
distribution (5.1 1), gives the observed population 
numbers. If the distribution of atoms in different 
levels is a result of mutual collisions of the atoms 
only, the excitation temperature equals the kinetic 
temperature, T exc = 7k- 

The ionisation temperature 7, is found by 
comparing the number of atoms in different states 
of ionisation. Since stars are not exactly black- 
bodies, the values of excitation and ionisation 
temperatures usually vary, depending on the el¬ 
ement whose spectral lines were used for temper¬ 
ature determination. 

In thermodynamic equilibrium all these vari¬ 
ous temperatures are equal. 


5.9 Other Radiation Mechanisms 

The radiation of a gas in thermodynamic equi¬ 
librium depends on the temperature and den¬ 
sity only. In astrophysical objects deviations from 
thermodynamic equilibrium are, however, quite 
common. Some examples of non-thermal radia¬ 
tion arising under such conditions are mentioned 
in the following. 

Maser and Laser (See Fig. 5.12.) The Boltz¬ 
mann distribution (5.11) shows that usually there 


Fig. 5.12 The operational principle of the maser and the 
laser. A metastable state (a state with a relatively long av¬ 
erage lifetime) stores atoms where they accumulate; there 
are more atoms in the metastable state than in the ground 
state. This population inversion is maintained by radia- 
tively exciting atoms to a higher excitation state (“pump¬ 
ing”), from which they spontaneously jump down to the 
metastable state. When the atoms are illuminated by pho¬ 
tons with energies equal to the excitation energy of the 
metastable state, the photons will induce more radiation 
of the same wavelength, and the radiation is amplified in 
geometric progression 


are fewer atoms in excited states than in the 
ground state. There are, however, means to pro¬ 
duce a population inversion, an excited state con¬ 
taining more atoms than the ground state. This 
inversion is essential for both the maser and the 
laser (Microwave/Light Amplification by Stimu¬ 
lated Emission of Radiation). If the excited atoms 
are now illuminated with photons having ener¬ 
gies equal to the excitation energy, the radia¬ 
tion will induce downward transitions. The num¬ 
ber of photons emitted greatly exceeds the num¬ 
ber of absorbed photons, and radiation is ampli¬ 
fied. Typically the excited state is a metastable 
state, a state with a very long average lifetime, 
which means that the contribution of spontaneous 
emission is negligible. Therefore the resulting 
radiation is coherent and monochromatic. Sev¬ 
eral maser sources have been found in interstel¬ 
lar molecular clouds and dust envelopes around 
stars. 
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Synchrotron Radiation A free charge in ac¬ 
celerated motion will emit electromagnetic radi¬ 
ation. Charged particles moving in a magnetic 
field follow helices around the field lines. As seen 
from the direction of the field, the motion is circu¬ 
lar and therefore accelerated. The moving charge 
will radiate in the direction of its velocity vector. 
Such radiation is called synchrotron radiation. It 
will be further discussed in Chap. 16. 


5.10 Radiative Transfer 

Propagation of radiation in a medium, also called 
radiative transfer, is one of the basic problems of 
astrophysics. The subject is too complicated to 
be discussed here in any detail. The fundamental 
equation of radiative transfer is, however, easily 
derived. 

Assume we have a small cylinder, the bottom 
of which has an area dA and the length of which 
is dr. Let I v be the intensity of radiation perpen¬ 
dicular to the bottom surface going into a solid 
angle dco ([/„] = Wm -2 Hz -1 sterad -1 ). If the 
intensity changes by an amount d7 y in the dis¬ 
tance dr, the energy changes by 

d E — dl v dA dv dot d t 

in the cylinder in time dr. This equals the emis¬ 
sion minus absorption in the cylinder. The ab¬ 
sorbed energy is (cf. (4.14)) 


or 

dl v i v 

— r = -Iv+— • (5.37) 

a v dr a v 

We shall denote the ratio of the emission coeffi¬ 
cient j v to the absorption coefficient or opacity a. v 
by S v : 

S v =—. (5.38) 

a v 

S v is called the source function. Because a v dr — 
dr y , where r„ is the optical thickness at fre¬ 
quency v, (5.37) can be written as 

dl v 

= -I v + S v . (5.39) 

dr y 

Equation (5.39) is the basic equation of radia¬ 
tive transfer. Without solving the equation, we see 
that if 7y < S v , then d7 y /dr y > 0, and the inten¬ 
sity tends to increase in the direction of propa¬ 
gation. And, if I v > S v , then d7 y /dr y < 0, and 
I v will decrease. In an equilibrium the emitted 
and absorbed energies are equal, in which case 
we find from (5.35) and (5.36) 

I v = j v /a v = S v . (5.40) 

Substituting this into (5.39), we see that d7 y /dr y 
= 0. In thermodynamic equilibrium the radiation 
of the medium is blackbody radiation, and the 
source function is given by Planck’s law: 


d£ a bs = a y 7 y dr dAdv dwdt, (5.35) 

where a v is the opacity of the medium at fre¬ 
quency v. Let the amount of energy emitted per 
hertz at frequency v into unit solid angle from 
unit volume and per unit time be j v ([ /,,] = 
Wm -3 Hz -1 sterad -1 ). This is called the emis¬ 
sion coefficient of the medium. The energy emit¬ 
ted into solid angle d&> from the cylinder is then 

d£em = jv dr dA dv dcodt. (5.36) 

The equation 

d E = -d£a bs + d£em 

gives then 

d/„ = —a v I v dr + j v dr 


c 2 shv/(kT) _ j • 

Even if the system is not in thermodynamic equi¬ 
librium, it may be possible to find an excitation 
temperature T exc such that B v (T exc ) = S v . This 
temperature may depend on frequency. 

A formal solution of (5.39) is 

7 y (r y ) = 7 y (0)e -T " + [ V (T “ -,) S y (r)df. 

Jo 

(5.41) 

Here 7 y (0) is the intensity of the background radi¬ 
ation, coming through the medium (e.g. an inter¬ 
stellar cloud) and decaying exponentially in the 
medium. The second term gives the emission in 
the medium. The solution is only formal, since in 
general, the source function S v is unknown and 
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must be solved simultaneously with the intensity. 
If S v (r v ) is constant in the cloud and the back¬ 
ground radiation is ignored, we get 

Iv(r v ) = S V I' e- (r "- ?, d t = S„(l - e- T “). 

Jo 

(5.42) 

If the cloud is optically thick ( x v 'J>> 1), we have 
Iv = S v , (5.43) 


This expression will be used later in Chap. 8 on 
the interpretation of stellar spectra. 


5.11 Examples 

Example 5.1 Find the wavelength of the photon 
emitted in the transition of a hydrogen atom from 
n 2 = 110 to «i = 109. 

Equation (5.8) gives 


i.e. the intensity equals the source function, and 
the emission and absorption processes are in 
equilibrium. 

An important field of application of the theory 
of radiative transfer is in the study of planetary 
and stellar atmospheres. In this case, to a good 
approximation, the properties of the medium only 
vary in one direction, say along the z axis. The 
intensity will then depend only on z and 6 , where 
9 is the angle between the z axis and the direction 
of propagation of the radiation. 

In applications to atmospheres it is customary 
to define the optical depth x v in the vertical direc¬ 
tion as 


dr y = — a v dz. 

Conventionally z increases upwards and the opti¬ 
cal depth inwards in the atmosphere. The vertical 
line element dz is related to that along the light 
ray, dr, according to 

dz = dr cosd. 


With these notational conventions, (5.39) now 
yields 


cos 6 


d I v {z,0) 
dr„ 


= /v 


Sv. 


(5.44) 


This is the form of the equation of radiative trans¬ 
fer usually encountered in the study of stellar and 
planetary atmospheres. 

A formal expression for the intensity emerging 
from an atmosphere can be obtained by integrat¬ 
ing (5.44) from x v — oo (we assume that the bot¬ 
tom of the atmosphere is at infinite optical depth) 
to x v = 0 (corresponding to the top of the atmo¬ 
sphere). This yields 


HH) 

= 1.097 x 10 7 m -1 ( - T ) 

\ 109 2 no 2 / 

= 16.71 m“\ 
whence 


X — 0.060 m. 

This is in the radio band. Such radiation was ob¬ 
served for the first time in 1965 by an NRAO ra¬ 
dio telescope. 

Example 5.2 The effective temperature of a star 
is 12,000 K and the absolute bolometric magni¬ 
tude 0.0. Find the radius of the star, when the ef¬ 
fective temperature of the Sun is 5000 K and the 
absolute bolometric magnitude 4.7. 

We can apply (5.21): 

R T 

M bo i - M bo i,o = —5 lg -10lg —— 

Kq 1q 

, JL = ( Vio -a2(Mbo i- Mboi '0) 
Rq V T e ) 

_ /_5800 \ jq—0.2(0.0—4.7) 

\ 12,000 / 

= 2 . 0 . 

Thus the radius is twice the Solar radius. 

Example 5.3 Derive the Wien displacement 
laws. 

Let us denote x — hc/(XkT). Planck’s law 
then becomes 


Iv( 0,9) 



c „-r„sec@ 

Oy C 


seed dr y . 


(5.45) 


th(T) = 


2k 5 T 5 


/z 4 c 3 e- v — 1 
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For a given temperature, the first factor is con¬ 
stant. Thus, it is sufficient to find the maximum 
of the function /(a) — x 5 /(e v — 1). 

First we must evaluate the derivative of /: 


/'(*) 


5„.v 


5x 4 (e A — 1) — A'-’e- 
(e A - l) 2 

(5 - 5e _A 


x 4 e x 


(e v - 1) 


0 - 


or 

— = 1.701 x 10 -11 Ks. 

'•’max 

Note that the wavelength corresponding to v max 
is different from A max . The reason is that we have 
used two different forms of Planck’s function, 
one giving the intensity per unit wavelength, the 
other per unit frequency. 


By definition, x is always strictly positive. 
Hence f'(x) can be zero only if the factor 5 — 
5e _A — x is zero. This equation cannot be solved 
analytically. Instead we write the equation as x = 
5 — 5e _A and solve it by iteration: 

AO = 5 (this is just a guess), 

A! = 5 — 5e _A ° = 4.96631, 


Example 5.4 (a) Find the fraction of radiation 
that a blackbody emits in the range [k 1 , 7 . 2 ], 
where Ai and a 2 AE> A max . (b) How much energy 
does a 100 W incandescent light bulb radiate in 
the radio wavelengths, A, > 1 cm? Assume the 
temperature is 2500 K. 

Since the wavelengths are much longer than 
7.| nax we can use the Rayleigh-Jeans approxima¬ 
tion BxiT ) 2 ckT /A 4 . Then 


a 5 = 4.96511. 


Thus the result is a = 4.965. The Wien displace¬ 
ment law is then 

AmaxT = — —b — 2.898 x 10 -3 Km. 
a k 


In terms of frequency Planck’s law is 


B V (T) 


2hv 3 1 
c 2 e hv/(kT) _ l ■ 


Substituting x — hv / (kT) we get 


BAT) 


2k 3 T 3 x 3 
h 2 c 2 e A — 1 


Now we study the function fix) = A 3 /(e A — 1): 
3a 2 (e A — 1) — A 3 e A 


fix) = 


x 2 e x 


(e x - l) 2 


(e x - l) 2 

(3 — 3e~ x — a). 


This vanishes, when 3 — 3e A — a = 0. The solu¬ 
tion of this equation is a = 2.821. Hence 



and hence 

B' _ 5c 3 /! 3 1/1 1 \ 

Bf~ k 3 7t 4 T 3 \kf _ a|/ 

Now the temperature is T — 2500 K and the 
wavelength range [ 0.01 m, 00 ), and so 

B' = 100 W x 1.529 x 10 - 7 —i -——7 

2500 3 0.01 3 

= 9.8 x 10 “ 10 W. 

It is quite difficult to listen to the radio emission 
of a light bulb with an ordinary radio receiver. 

Example 5.5 (Determination of Effective Tem¬ 
perature) The observed flux density of Arcturus 
is 

F' = 4.5 x 10 “ 8 Wm“ 2 . 


cT 

Hnax 


— =b' = 5.100 x 10 -3 Km 
kx 


Interferometric measurements give an angular di¬ 
ameter of a — 0.020". Thus, a/2 = 4.85 x 10 -8 
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radians. From (5.26) we get 


T e .= 


4.5 X 1CT 


(4.85 x 10 -8 ) 2 x 5.669 x 10“ 8 


1/4 


K 


= 4300 K. 


Exercise 5.2 If the transition of the hydrogen 
atom n + I -* n were to correspond to the wave¬ 
length 21.05 cm, what would the quantum num¬ 
ber n be? The interstellar medium emits strong 
radiation at this wavelength. Can this radiation be 
due to such transitions? 


Example 5.6 Flux densities at the wavelengths 
440 nm and 550 nm are 1.30 and 1.00 Wm _2 m _1 , 
respectively. Find the colour temperature. 

If the flux densities at the wavelengths /. | and 
/.2 are F\ and F 2 , respectively, the colour temper¬ 
ature can be solved from the equation 

Fi B ki (T c ) /A 2 \ 5 e /lc /(A2^ c j _ i 
F 2 - B k2 (T c ) ~ Ui ) eWCi^c) - 1 ■ 

If we denote 


A = 


B i = 



he 



X\k' 


5 


Bi = 


he 
A. 2 k 


we get the equation 


e B 2 /T c _ 1 
A= qB\/T z _ l 


for the colour temperature T c . This equation must 
be solved numerically. 

In our example the constants have the follow¬ 
ing values: 


_ 1.00 /550\ 5 
~ F30 V440/ 


2.348, 


B\ = 32,700 K, Z?2 = 26,160 K. 


Exercise 5.3 The space is filled with back¬ 
ground radiation, remnant of the early age of the 
universe. Currently the distribution of this radi¬ 
ation is similar to the radiation of a blackbody 
at the temperature of 2.7 K. What is /. nlax corre¬ 
sponding to this radiation? What is its total in¬ 
tensity? Compare the intensity of the background 
radiation to the intensity of the Sun at the visual 
wavelengths. 

Exercise 5.4 The temperature of a red giant is 
T — 2500 K and radius 100 times the solar ra¬ 
dius. 

(a) Find the total luminosity of the star, and the 
luminosity in the visual band 400 nm < X < 
700 nm. 

(b) Compare the star with a 100 W lamp that ra¬ 
diates 5 % of its energy in the visual band. 
What is the distance of the lamp if it looks as 
bright as the star? 

Exercise 5.5 The effective temperature of Sir¬ 
ius is 10,000 K, apparent visual magnitude —1.5, 
distance 2.67 kpc and bolometric correction 0.5. 
What is the radius of Sirius? 

Exercise 5.6 The observed flux density of the 
Sun at A. — 300 nm is 0.59 W m -2 nm' 1 . Find the 
brightness temperature of the Sun at this wave¬ 
length. 


By substituting different values for T c , we find 
that T c = 7545 K satisfies our equation. 


Exercise 5.7 The colour temperature can be de¬ 
termined from two magnitudes corresponding to 
two different wavelengths. Show that 


5.12 Exercises 

Exercise 5.1 Show that in the Wien approxima¬ 
tion the relative error of B, is 

_ _ -he/(XkT) 

B k 


7000 K 

c ~ (B - V) + 0.47' 

The wavelengths of the B and V bands are 
440 nm and 548 nm, respectively, and we assume 
that B — V for stars of the spectral class A0, the 
colour temperature of which is about 15,000 K. 
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Exercise 5.8 The kinetic temperature of the Find the average speed of the electrons in such 
plasma in the solar corona can reach 10 6 K. a plasma. 




Celestial Mechanics 


6 


Celestial mechanics, the study of motions of ce¬ 
lestial bodies, together with spherical astronomy, 
was the main branch of astronomy until the end 
of the 19th century, when astrophysics began 
to evolve rapidly. The primary task of classical 
celestial mechanics was to explain and predict 
the motions of planets and their satellites. Sev¬ 
eral empirical models, like epicycles and Kepler’s 
laws, were employed to describe these motions. 
But none of these models explained why the plan¬ 
ets moved the way they did. It was only in the 
1680’s that a simple explanation was found for all 
these motions—Newton’s law of universal gravi¬ 
tation. In this chapter, we will derive some prop¬ 
erties of orbital motion. The physics we need for 
this is simple indeed, just Newton’s laws. (For 
a review, see Box 6 .1 .) 

This chapter is mathematically slightly more 
involved than the rest of the book. We shall use 
some vector calculus to derive our results, which, 
however, can be easily understood with very el¬ 
ementary mathematics. A summary of the basic 
facts of vector calculus is given in Appendix A. 4. 


6.1 Equations of Motion 

We shall concentrate on systems of only two bod¬ 
ies. In fact, this is the most complicated case that 
allows a neat analytical solution. For simplicity, 
let us call the bodies the Sun and a planet, al¬ 
though they could quite as well be a planet and 
its moon, or the two components of a binary star. 

Let the masses of the two bodies be m \ and m 2 
and the radius vectors in some fixed inertial coor- 



Fig.6.1 The radius vectors of the Sun and a planet in an 
arbitrary inertial frame are r\ and rj, and r = rj — r\ is 
the position of the planet relative to the Sun 

dinate frame r\ and rj (Fig. 6.1). The position 
of the planet relative to the Sun is denoted by 
r — ri — r \. According to Newton’s law of gravi¬ 
tation the planet feels a gravitational pull propor¬ 
tional to the masses m \ and m 2 and inversely pro¬ 
portional to the square of the distance r. Since the 
force is directed towards the Sun, it can be ex¬ 
pressed as 

Gnumi—r r 

F= -^- = —Gm\m2^, (6.1) 

r A r r i 

where G is the gravitational constant. (More 
about this in Sect. 6.5.) 

Newton’s second law tells us that the accelera¬ 
tion r 2 of the planet is proportional to the applied 
force: 

F — /77 2 i* 2 • (6.2) 

Combining (6.1) and (6.2), we get the equation of 
motion of the planet 

r 

m 2 r 2 = -Gmim 2 - 7 . (6.3) 
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Since the Sun feels the same gravitational pull, 
but in the opposite direction, we can immediately 
write the equation of motion of the Sun: 

r 

m\r\ = -\-Gm\m 2 — t. (6.4) 

yi 

We are mainly interested in the relative mo¬ 
tion of the planet with respect to the Sun. To find 
the equation of the relative orbit, we cancel the 
masses appearing on both sides of (6.3) and (6.4), 
and subtract (6.4) from (6.3) to get 

r 

r = -n ^r, (6.5) 

where we have denoted 

fx = G(m\ + 7712 )- ( 6 . 6 ) 

The solution of (6.5) now gives the relative or¬ 
bit of the planet. The equation involves the radius 
vector and its second time derivative. In princi¬ 
ple, the solution should yield the radius vector as 
a function of time, r = r(t). Unfortunately things 
are not this simple in practice; in fact, there is 
no way to express the radius vector as a function 
of time in a closed form (i.e. as a finite expres¬ 
sion involving familiar elementary functions). Al¬ 
though there are several ways to solve the equa¬ 
tion of motion, we must resort to mathematical 
manipulation in one form or another to figure out 
the essential properties of the orbit. Next we shall 
study one possible method. 


6.2 Solution of the Equation of 
Motion 

The equation of motion (6.5) is a second-order 
(i.e. contains second derivatives) vector valued 
differential equation. Therefore we need six in¬ 
tegration constants or integrals for the complete 
solution. The solution is an infinite family of or¬ 
bits with different sizes, shapes and orientations. 
A particular solution (e.g. the orbit of Jupiter) is 
selected by fixing the values of the six integrals. 
The fate of a planet is unambiguously determined 
by its position and velocity at any given moment; 
thus we could take the position and velocity vec¬ 
tors at some moment as our integrals. Although 



Fig. 6.2 The angular momentum vector k is perpendicu¬ 
lar to the radius and velocity vectors of the planet. Since 
k is a constant vector, the motion of the planet is restricted 
to the plane perpendicular to k 

they do not tell us anything about the geome¬ 
try of the orbit, they can be used as initial val¬ 
ues when integrating the orbit numerically with 
a computer. Another set of integrals, the orbital 
elements, contains geometric quantities describ¬ 
ing the orbit in a very clear and concrete way. We 
shall return to these later. A third possible set in¬ 
volves certain physical quantities, which we shall 
derive next. 

We begin by showing that the angular momen¬ 
tum remains constant. The angular momentum of 
the planet in the heliocentric frame is 

L — ni 2 r x r. (6.7) 

Celestial mechanicians usually prefer to use the 
angular momentum divided by the planet’s mass 

k = rxr. (6.8) 

Let us find the time derivative of this: 

k = rxr + rxr. 

The latter term vanishes as a vector product of 
two parallel vectors. The former term contains r, 
which is given by the equation of motion: 

k — rx (— ixr/r 3 ) = — (/r/r 3 )r x r — 0. 

Thus A: is a constant vector independent of time 
(as is L, of course). 

Since the angular momentum vector is always 
perpendicular to the motion (this follows from 
(6.8)), the motion is at all times restricted to the 
invariable plane perpendicular to k (Fig. 6.2). 
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Fig. 6.3 The radial velocity r is the projection of the ve¬ 
locity vector r in the direction of the radius vector r 


To find another constant vector, we compute 
the vector product k x r: 

k x r — (r x r) x (—/rr/r 3 ) 

= -^[{r ■ r)r - (r ■ r)r\ 

The time derivative of the distance r is equal to 
the projection of r in the direction of r (Fig. 6.3); 
thus, using the properties of the scalar product, 
we get r — r ■ r / r , which gives 

r ■ r — rr. (6.9) 


Hence, 

/ 9\ d 

k x r — —fi(r/r — rr Ir ) = —(— fir/r). 

' d t 

The vector product can also be expressed as 
d 

k x r = — (k x r), 

dr 

since A: is a constant vector. Combining this with 
the previous equation, we have 



Fig. 6.4 The orbit of an object in the gravitational field of 
another object is a conic section: ellipse, parabola or hy¬ 
perbola. Vector e points to the direction of the pericentre, 
where the orbiting object is closest to central body. If the 
central body is the Sun, this direction is called the perihe¬ 
lion; if some other star, periastron; if the Earth, perigee, 
etc. The true anomaly / is measured from the pericentre 


where the planet is closest to the Sun in its orbit. 
This point is called the perihelion. 

One more constant is found by computing rr: 

r r — —fir ■ r/r 3 = — firr/r 3 
^ d 

= ~l ir / r ~ = -r-M/r). 


and 


— (k x r + fir /r) = 0 


Since we also have 


r ■ r 


d 

dr 



k x r + fir/r — const = — fie. (6.10) we get 

Since k is perpendicular to the orbital plane, 
k x r must lie in that plane. Thus, e is a linear 
combination of two vectors in the orbital plane; or 
so e itself must be in the orbital plane (Fig. 6.4). 

Later we shall see that it points to the direction 


d 

dr 





— fi/r — const = h. 


( 6 . 11 ) 
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Here v is the speed of the planet relative to the 
Sun. The constant h is called the energy integral ; 
the total energy of the planet is ni 2 h. We must not 
forget that energy and angular momentum depend 
on the coordinate frame used. Here we have used 
a heliocentric frame, which in fact is in acceler¬ 
ated motion. 

So far, we have found two constant vectors and 
one constant scalar. It looks as though we already 
have seven integrals, i.e. one too many. But not all 
of these constants are independent; specifically, 
the following two relations hold: 

k ■ e = 0, (6.12) 

p 2 (e 2 — l) = 2hk 2 , (6.13) 

where e and k are the lengths of e and k. The 
first equation is obvious from the definitions of e 
and k. To prove (6.13), we square both sides of 
(6.10) to get 

9 t 9 r ■ r pr 

p~e~ = (k x r) ■ (k x r) + p — y~ + 2 (kxr) - . 

r l r 

Since k is perpendicular to r, the length of 
k x r is |£| \r | = kv and (k x r) ■ (k x r) — k 2 v 2 . 
Thus, we have 

p 2 e 2 = k 2 v 2 + /x 2 + —{k x r ■ r). 

r 

The last term contains a scalar triple product, 
where we can exchange the dot and cross to get 
k ■ r x r. Next we reverse the order of the two 
last factors. Because the vector product is anti- 
commutative, we have to change the sign of the 
product: 


planet is! To fix its position in the orbit, we have 
to determine where the planet is at some given in¬ 
stant of time t = to, or alternatively, at what time 
it is in some given direction. We use the latter 
method by specifying the time of perihelion pas¬ 
sage, the time of perihelion r. 


6.3 Equation of the Orbit and 
Kepler's First Law 

In order to find the geometric shape of the orbit, 
we now derive the equation of the orbit. Since e is 
a constant vector lying in the orbital plane, we 
choose it as the reference direction. We denote the 
angle between the radius vector r and e by /. The 
angle / is called the true anomaly. (There is noth¬ 
ing false or anomalous in this and other anomalies 
we shall meet later. Angles measured from the 
perihelion point are called anomalies to distin¬ 
guish them from longitudes measured from some 
other reference point, usually the vernal equinox.) 
Using the properties of the scalar product we get 

r ■ e = re cos /. 

But the product r ■ e can also be evaluated using 
the definition of e: 

r ■ e = - (r ■ k x r + pr ■ r/ r) 

/x 

1 1 . 9 

=- (k r x r + pr) = - l—k + pr) 

_e_ 

p. 


p 2 (e 2 — l) = k 2 v 2 - —{k ■ r x r) 

= k 2 V 2 — 2 ——k 2 
r 

= 2 k 2 (^v 2 - = 2k 2 h. 

This completes the proof of (6.13). 

The relations (6.12) and (6.13) reduce the 
number of independent integrals by two, so we 
still need one more. The constants we have de¬ 
scribe the size, shape and orientation of the or¬ 
bit completely, but we do not yet know where the 


Equating the two expressions of r ■ e we get 


k 2 /p 

1 + ecos f 


(6.14) 


This is the general equation of a conic section in 
polar coordinates (Fig. 6.4; see Appendix A.2 for 
a brief summary of conic sections). The magni¬ 
tude of e gives the eccentricity of the conic: 


e = 0 circle, 

0 < e < 1 ellipse, 
e = 1 parabola, 

e > 1 hyperbola. 
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Fig. 6.5 Six integration constants are needed to describe 
a planet’s orbit. These constants can be chosen in vari¬ 
ous ways, (a) If the orbit is to be computed numerically, 
the simplest choice is to use the initial values of the ra¬ 
dius and velocity vectors, (b) Another possibility is to use 
the angular momentum k , the direction of the perihelion e 

Inspecting (6.14), we find that r attains its mini¬ 
mum when f — 0, i.e. in the direction of the vec¬ 
tor e. Thus, e indeed points to the direction of the 
perihelion. 

Starting with Newton’s laws, we have thus 
managed to prove Kepler’s first law: 

The orbit of a planet is an ellipse, one focus of 

which is in the Sun. 

Without any extra effort, we have shown that 
also other conic sections, the parabola and hyper¬ 
bola, are possible orbits. 


6.4 Orbital Elements 

We have derived a set of integrals convenient for 
studying the dynamics of orbital motion. We now 
turn to another collection of constants more ap¬ 
propriate for describing the geometry of the orbit. 
The following six quantities are called the orbital 
elements (Fig. 6.5): 

- semimajor axis a, 

- eccentricity e, 

- inclination i (or i), 

- longitude of the ascending node Q, 

- argument of the perihelion co, 

- time of the perihelion r. 

The eccentricity is obtained readily as the 
length of the vector e. From the equation of the 


(the length of which gives the eccentricity), and the peri¬ 
helion time r. (c) The third method best describes the ge¬ 
ometry of the orbit. The constants are the longitude of the 
ascending node Q , the argument of perihelion a), the in¬ 
clination i, the semimajor axis a, the eccentricity e and 
the time of perihelion r 

orbit (6.14), we see that the parameter (or semi- 
latus rectum) of the orbit is p = k 2 / p. But the 
parameter of a conic section is always a\ 1 — e 2 \, 
which gives the semimajor axis, if e and k are 
known: 


By applying (6.13), we get an important relation 
between the size of the orbit and the energy inte¬ 
gral h : 

I —p/2h , if the orbit is an ellipse, 
p/2h, if the orbit is a hyperbola. 

(6.16) 

For a bound system (elliptical orbit), the total 
energy and the energy integral are negative. For 
a hyperbolic orbit h is positive; the kinetic en¬ 
ergy is so high that the particle can escape the sys¬ 
tem (or more correctly, recede without any limit). 
The parabola, with h = 0, is a limiting case be¬ 
tween elliptical and hyperbolic orbits. In reality 
parabolic orbits do not exist, since hardly any 
object can have an energy integral exactly zero. 
However, if the eccentricity is very close to one 
(as with many comets), the orbit is usually con¬ 
sidered parabolic to simplify calculations. 

The orientation of the orbit is determined by 
the directions of the two vectors k (perpendicular 
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to the orbital plane) and e (pointing towards the 
perihelion). The three angles i, £2 and co contain 
the same information. 

The inclination i gives the obliquity of the or¬ 
bital plane relative to some fixed reference plane. 
For bodies in the solar system, the reference plane 
is usually the ecliptic. For objects moving in the 
usual fashion, i.e. counterclockwise, the inclina¬ 
tion is in the interval [0°, 90°]; for retrograde or¬ 
bits (clockwise motion), the inclination is in the 
range (90°, 180°]. For example, the inclination of 
Halley’s comet is 162°, which means that the mo¬ 
tion is retrograde and the angle between its orbital 
plane and the ecliptic is 180° — 162° =18°. 

The longitude of the ascending node, £2 , indi¬ 
cates where the object crosses the ecliptic from 
south to north. It is measured counterclockwise 
from the vernal equinox. The orbital elements i 
and £2 together determine the orientation of the 
orbital plane, and they correspond to the direction 
of k, i.e. the ratios of its components. 

The argument of the perihelion co gives the 
direction of the perihelion, measured from the 
ascending node in the direction of motion. The 
same information is contained in the direction 
of e. Very often another angle, the longitude of 
the perihelion ur (pronounced as pi), is used in¬ 
stead of a). It is defined as 

ur = £2 + co. (6.17) 

This is a rather peculiar angle, as it is measured 
partly along the ecliptic, partly along the orbital 
plane. However, it is often more practical than 
the argument of perihelion, since it is well de¬ 
fined even when the inclination is close to zero 
in which case the direction of the ascending node 
becomes indeterminate. 

We have assumed up to this point that each 
planet forms a separate two-body system with the 
Sun. In reality planets interfere with each other by 
disturbing each other’s orbits. Still their motions 
do not deviate very far from the shape of conic 
sections, and we can use orbital elements to de¬ 
scribe the orbits. But the elements are no longer 
constant; they vary slowly with time. Moreover, 
their geometric interpretation is no longer quite 
as obvious as before. Such elements are osculat¬ 
ing elements that would describe the orbit if all 



Fig. 6.6 Unit vectors e r and Cf of the polar coordinate 
frame. The directions of these change while the planet 
moves along its orbit 

perturbations were to suddenly disappear. They 
can be used to find the positions and velocities of 
the planets exactly as if the elements were con¬ 
stants. The only difference is that we have to use 
different elements for each moment of time. 

Table C.12 (at the end of the book) gives 
the mean orbital elements for the nine planets 
for the epoch J2000.0 as well as their first time 
derivatives. In addition to these secular varia¬ 
tions the orbital elements suffer from periodic 
disturbations, which are not included in the ta¬ 
ble. Thus only approximate positions can be cal¬ 
culated with these elements. Instead of the time 
of perihelion the table gives the mean longitude 

L = M + co+ £2, (6.18) 

which gives directly the mean anomaly M (which 
will be defined in Sect. 6.7). 


6.5 Kepler's Second and Third Law 

The radius vector of a planet in polar coordinates 
is simply 

r — re r , (6.19) 

where e r is a unit vector parallel with r (Fig. 6.6). 
If the planet moves with angular velocity /, the 
direction of this unit vector also changes at the 
same rate: 

e r = fe f . 


( 6 . 20 ) 
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where ef is a unit vector perpendicular to e r . The 
velocity of the planet is found by taking the time 
derivative of (6.19): 

r = re r + re r = re r + rfef. (6.21) 

The angular momentum k can now be evaluated 
using (6.19) and (6.21): 

k = rxr = r 2 fe z , (6.22) 

where e z is a unit vector perpendicular to the or¬ 
bital plane. The magnitude of k is 



k = r 2 f. (6.23) 

The surface velocity of a planet means the area 
swept by the radius vector per unit of time. This 
is obviously the time derivative of some area, so 
let us call it A. In terms of the distance r and true 
anomaly /, the surface velocity is 

A= X -r 2 f. (6.24) 

By comparing this with the length of k (6.23), we 
find that 

A = -k. (6.25) 

2 

Since k is constant, so is the surface velocity. 
Hence we have Kepler’s second law: 

The radius vector of a planet sweeps equal areas in 
equal amounts of time. 


Fig. 6.7 The areas of the shaded sectors of the ellipse are 
equal. According to Kepler’s second law, it takes equal 
times to travel distances AB, CD and EF 

where P is the orbital period. Since the area of 
the ellipse is 

nab = TtcrsJ 1 — e 2 , (6.28) 

where a and b are the semimajor and semiminor 
axes and e the eccentricity, we get 

na 2 s/l - e 2 = hcP. (6.29) 

To find the length of k, we substitute the energy 
integral h as a function of semimajor axis (6.16) 
into (6.13) to get 

k = yjG(m\ + m 2 )a (1 — e 2 ). (6.30) 


Since the Sun-planet distance varies, the or¬ 
bital velocity must also vary (Fig. 6.7). From Ke¬ 
pler’s second law it follows that a planet must 
move fastest when it is closest to the Sun (near 
perihelion). Motion is slowest when the planet is 
farthest from the Sun at aphelion. 

We can write (6.25) in the form 


When this is substituted into (6.29) we have 


P 2 = 


4;r 2 


G(in i + m 2) 


(6.31) 


This is the exact form of Kepler’s third law as 
derived from Newton’s laws. The original version 
was 


1 

d A = -kdt , 
2 


The ratio of the cubes of the semimajor axes of 
(6.26) the orbits of two planets is equal to the ratio of the 
squares of their orbital periods. 


and integrate over one complete period: 


L 


d A = - 


orbital ellipse 


1 r 1 

2 Jo 


dr, 


In this form the law is not exactly valid, even 
for planets of the solar system, since their own 
(6 27) masses influence their periods. The errors due to 
ignoring this effect are very small, however. 
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Kepler’s third law becomes remarkably simple 
if we express distances in astronomical units (au), 
times in sidereal years (the abbreviation is unfor¬ 
tunately a, not to be confused with the semimajor 
axis, denoted by a somewhat similar symbol a) 
and masses in solar masses (Mq). Then G = Arc 2 
and 

a 2 = (m\ + m 2 )P 2 . (6.32) 

The masses of objects orbiting around the Sun 
can safely be ignored (except for the largest plan¬ 
ets), and we have the original law P 2 — a 2 . This 
is very useful for determining distances of var¬ 
ious objects whose periods have been observed. 
For absolute distances we have to measure at least 
one distance in metres to find the length of one 
au. Earlier, triangulation was used to measure the 
parallax of the Sun or a minor planet, such as 
Eros, that comes very close to the Earth. Nowa¬ 
days, radiotelescopes are used as radar to very 
accurately measure, for example, the distance to 
Venus. Since changes in the value of one au also 
change all other distances, the International As¬ 
tronomical Union decided in 1968 to adopt the 
value 1 au= 1.496000 x 10 11 m. The semimajor 
axis of Earth’s orbit is then slightly over one au. 
But constants tend to change. And so, after 1984, 
the astronomical unit has a new value, 

1 au = 1.49597870 x 10 11 m. 

This corresponds to the radius of an object that 
has the same orbital period as the Earth but that 
has no mass. Since also the Earth’s mass affects 
its period, the actual semimajor axis of the Earth 
must be slightly bigger than one au. 

Another important application of Kepler’s 
third law is the determination of masses. By ob¬ 
serving the period of a natural or artificial satel¬ 
lite, the mass of the central body can be obtained 
immediately. The same method is used to deter¬ 
mine masses of binary stars (more about this sub¬ 
ject in Chap. 10). 

Although the values of the au and year are ac¬ 
curately known in Si-units, the gravitational con¬ 
stant is known only approximately. Astronomical 
observations give the product G(m i + m 2 ), but 
there is no way to distinguish between the con¬ 
tributions of the gravitational constant and those 


of the masses. The gravitational constant must be 
measured in the laboratory; this is very difficult 
because of the weakness of gravitation. There¬ 
fore, if a precision higher than 2-3 significant 
digits is required, the Si-units cannot be used. 
Instead we have to use the solar mass as a unit 
of mass (or, for example, the Earth’s mass after 
G»i© has been determined from observations of 
satellite orbits). 


6.6 Systems of Several Bodies 


This far we have discussed systems consisting of 
only two bodies. In fact it is the most complex 
system for which a complete solution is known. 
The equations of motion are easily generalised, 
though. As in (6.5) we get the equation of motion 
for the body k,k = 1,...,«: 


i = n 

ft = '^2, Gm > 

i=ij^k 


n - r k 

I rt - r k I 3 ’ 


(6.33) 


where m; is the mass of the r th body and r, its 
radius vector. On the right hand side of the equa¬ 
tion we now have the total gravitational force 
due to all other objects, instead of the force of 
just one body. If there are more than two bodies, 
these equations cannot be solved analytically in 
a closed form (Fig. 6.8). The only integrals that 
can be easily derived in the general case are the 
total energy, total momentum, and total angular 
momentum. 

If the radius and velocity vectors of all bodies 
are known for a certain instant of time, the posi¬ 
tions at some other time can easily be calculated 
numerically from the equations of motion. For 
example, the planetary positions needed for as¬ 
tronomical yearbooks are computed by integrat¬ 
ing the equations numerically. 

Another method can be applied if the grav¬ 
ity of one body dominates like in the solar sys¬ 
tem. Planetary orbits can then be calculated as 
in a two-body system, and the effects of other 
planets taken into account as small perturbations. 
For these perturbations several series expansions 
have been derived. 

The restricted three-body problem is an exten¬ 
sively studied special case. It consists of two mas¬ 
sive bodies or primaries, moving on circular or- 





6.7 Orbit Determination 


131 


S: 



Fig. 6.8 When a system consists of more than two bodies, 
the equations of motion cannot be solved analytically. In 
the solar system the mutual disturbances of the planets are 
usually small and can be taken into account as small per¬ 
turbations in the orbital elements. K.F. Sundman designed 
a machine to carry out the tedious integration of the pertur¬ 
bation equations. This machine, called the perturbograph, 
is one of the earliest analogue computers; unfortunately 
it was never built. Shown is a design for one component 
that evaluates a certain integral occurring in the equations. 
(The picture appeared in K.F. Sundman's paper in Fest- 
skrift tillegnad Anders Donner in 1915) 


bits around each other, and a third, massless body, 
moving in the same plane with the primaries. This 
small object does in no way disturb the motion of 
the primaries. Thus the orbits of the massive bod¬ 
ies are as simple as possible, and their positions 
are easily computed for all times. The problem is 
to find the orbit of the third body. It turns out that 
there is no finite expression for this orbit. 

The Finnish astronomer Karl Frithiof Sund¬ 
man (1873-1949) managed to show that a solu¬ 
tion exists and derive a series expansion for the 
orbit. The series converges so slowly that it has 



Fig. 6.9 The Lagrangian points of the restricted three- 
body problem. The points L\, Z -2 and L 3 are on the same 
line with the primaries, but the numbering may vary. The 
points L 4 and L 5 form equilateral triangles with the pri¬ 
maries 

no practical use, but as a mathematical result it 
was remarkable, since many mathematicians had 
for a long time tried to attack the problem without 
success. 

The three-body problem has some interesting 
special solutions. It can be shown that in certain 
points the third body can remain at rest with re¬ 
spect to the primaries. There are five such points, 
known as the Lagrangian points 
(Fig. 6.9). Three of them are on the straight line 
determined by the primaries. These points are un¬ 
stable: if a body in any of these points is dis¬ 
turbed, it will escape. The two other points, on 
the other hand, are stable. These points together 
with the primaries form equilateral triangles. For 
example, some asteroids have been found around 
the Lagrangian points L 4 and L$ of Jupiter. The 
first of them were named after heroes of the Tro¬ 
jan war, and so they are called Trojan asteroids. 
They move around the Lagrangian points and 
can actually travel quite far from them, but they 
cannot escape. Figure 8.38 shows two distinct 
condensations around the Lagrangian points of 
Jupiter. Later similar Trojan asteroids have been 
found also on the orbits of other planets. 


6.7 Orbit Determination 

Celestial mechanics has two very practical tasks: 
to determine orbital elements from observations 
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and to predict positions of celestial bodies with 
known elements. Planetary orbits are already 
known very accurately, but new comets and mi¬ 
nor planets are found frequently, requiring orbit 
determination. 

The first practical methods for orbit determi¬ 
nation were developed by Johann Karl Friedrich 
Gauss (1777-1855) at the beginning of the 19th 
century. By that time the first minor planets had 
been discovered, and thanks to Gauss’s orbit de¬ 
terminations, they could be found and observed 
at any time. 

At least three observations are needed for 
computing the orbital elements. The directions 
are usually measured from pictures taken a few 
nights apart. Using these directions, it is possible 
to find the corresponding absolute positions (the 
rectangular components of the radius vector). To 
be able to do this, we need some additional con¬ 
straints on the orbit; we must assume that the ob¬ 
ject moves along a conic section lying in a plane 
that passes through the Sun. When the three ra¬ 
dius vectors are known, the ellipse (or some other 
conic section) going through these three points 
can be determined. In practice, more observations 
are used. The elements determined are more ac¬ 
curate if there are more observations and if they 
cover the orbit more completely. 

Although the calculations for orbit determi¬ 
nation are not too involved mathematically, they 
are relatively long and laborious. Several meth¬ 
ods can be found in textbooks of celestial me¬ 
chanics. 


6.8 Position in the Orbit 

Although we already know everything about the 
geometry of the orbit, we still cannot find the 
planet at a given time, since we do not know the 
radius vector r as a function of time. The variable 
in the equation of the orbit is an angle, the true 
anomaly /, measured from the perihelion. From 
Kepler’s second law it follows that / cannot in¬ 
crease at a constant rate with time. Therefore we 
need some preparations before we can find the ra¬ 
dius vector at a given instant. 

The radius vector can be expressed as 

r — a(cos E — e)i + b sin Ej, (6.34) 



Fig. 6.10 Definition of the eccentric anomaly E. The 
planet is at P, and r is its radius vector 

where i and j are unit vectors parallel with the 
major and minor axes, respectively. The angle E 
is the eccentric anomaly, its slightly eccentric 
definition is shown in Fig. 6.10. Many formulas 
of elliptical motion become very simple if either 
time or true anomaly is replaced by the eccentric 
anomaly. As an example, we take the square of 
(6.34) to find the distance from the Sun: 

2 

r =r r 

= a 2 (cos E — e) 2 + b 2 sin 2 E 
= a 2 [(cos E — e) 2 + (l — e 2 )(l — cos 2 £)] 

= a 2 [l — 2ecos E + e 2 cos 2 E], 
whence 

r — a (1 — ecos£). (6.35) 

Our next problem is to find how to calculate E 
for a given moment of time. According to Ke¬ 
pler’s second law, the surface velocity is constant. 
Thus the area of the shaded sector in Fig. 6.10 is 

A = nab , (6.36) 

where t — r is the time elapsed since the peri¬ 
helion, and P is the orbital period. But the area 
of a part of an ellipse is obtained by reducing 
the area of the corresponding part of the circum¬ 
scribed circle by the axial ratio b/a. (As the math- 
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ematicians say, an ellipse is an affine transfor¬ 
mation of a circle.) Hence the area of SPX is 
(Fig. 6.11) 

A = — (area of SP'X ) 
a 

b , . 

= — (area of the sector CP X 
a 

— area of the triangle CP'S) 

b( 1 1 . \ 

= — 1 —a ■ aE - ae ■ a sin E 

a \2 2 ) 

1 

= -ab(E — esinZs). 

By equating these two expressions for the area A, 
we get the famous Kepler’s equation , 



Fig. 6.11 The area of the shaded sector equals b/a times 
the area SP'X. S = the Sun, P = the planet, X = the peri¬ 
helion 


Zs — esine = M, (6.37) 


where 

M= 2 ^{t- r) (6.38) 

is the mean anomaly of the planet at time t. The 
mean anomaly increases at a constant rate with 
time. It indicates where the planet would be if it 
moved in a circular orbit of radius a. For circular 
orbits all three anomalies /, E, and M are always 
equal. 

If we know the period and the time elapsed 
after the perihelion, we can use (6.38) to find the 
mean anomaly. Next we must solve for the eccen¬ 
tric anomaly from Kepler’s equation (6.37). Fi¬ 
nally the radius vector is given by (6.35). Since 
the components of r expressed in terms of the 
true anomaly are r cos / and r sin /, we find 


cos / = 


sin / = 


a(cosZs — e) cos E — e 
r 1 — e cos E ’ 

b sin E r - - sin E 

r 1 — e cos E 


(6.39) 


These determine the true anomaly, should it be of 
interest. 

Now we know the position in the orbital plane. 
This must usually be transformed to some other 
previously selected reference frame. For exam¬ 
ple, we may want to know the ecliptic longitude 
and latitude, which can later be used to find the 


right ascension and declination. These transfor¬ 
mations belong to the realm of spherical astron¬ 
omy and are briefly discussed in Examples 6.5- 
6.7. 


6.9 Escape Velocity 


If an object moves fast enough, it can escape from 
the gravitational field of the central body (to be 
precise: the field extends to infinity, so the object 
never really escapes, but is able to recede without 
any limit). If the escaping object has the mini¬ 
mum velocity allowing escape, it will have lost 
all its velocity at infinity (Fig. 6.12). There its ki¬ 
netic energy is zero, since v — 0, and the potential 
energy is also zero, since the distance r is infinite. 
At infinite distance the total energy as well as the 
energy integral h are zero. The law of conserva¬ 
tion of energy gives, then: 

It M 

-ir-^ = 0, (6.40) 

Z K 

where R is the initial distance at which the object 
is moving with velocity v. From this we can solve 
the escape velocity. 



j2G(mi + m 2 ) 
V R 


(6.41) 


For example on the surface of the Earth, v c is 
about 11 km/s (if m 2 <K m ©). 
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Fig. 6.12 A projectile is 
shot horizontally from 
a mountain on an 
atmosphereless planet. If 
the initial velocity is small, 
the orbit is an ellipse whose 
pericentre is inside the 
planet, and the projectile 
will hit the surface of the 
planet. When the velocity 
is increased, the pericentre 
moves outside the planet. 
When the initial velocity 
is u c , the orbit is circular. If 
the velocity is increased 
further, the eccentricity of 
the orbit grows again and 
the pericentre is at the 
height of the cannon. The 
apocentre moves further 
away until the orbit 
becomes parabolic when 
the initial velocity is v e . 
With even higher 
velocities, the orbit 
becomes hyperbolic 



The escape velocity can also be expressed us¬ 
ing the orbital velocity of a circular orbit. The or¬ 
bital period P as a function of the radius R of the 
orbit and the orbital velocity v c is 

2jtR 

P =-. 

Vc 

Substitution into Kepler’s third law yields 

4 :t 2 R 2 _ 4 n 2 R i 
v 2 G{m\+m2) 

From this we can solve the velocity v c in a circu¬ 
lar orbit of radius R: 

lG(mi+m 2 ) 

Vc = y - r -' ( 6 - 42 ) 

Comparing this with the expression (6.41) of the 
escape velocity, we see that 


6.10 Virial Theorem 

If a system consists of more than two objects, the 
equations of motion cannot in general be solved 
analytically (Fig. 6.12). Given some initial val¬ 
ues, the orbits can, of course, be found by numer¬ 
ical integration, but this does not tell us anything 
about the general properties of all possible orbits. 
The only integration constants available for an ar¬ 
bitrary system are the total momentum, angular 
momentum and energy. In addition to these, it is 
possible to derive certain statistical results, like 
the virial theorem. It concerns time averages only, 
but does not say anything about the actual state of 
the system at some specified moment. 

Suppose we have a system of n point masses 
nij with radius vectors r, and velocities r, . We 
define a quantity A (the “virial” of the system) as 
follows: 

n 

A = 'Y^m i r i 
1 = 1 


v e = ~Jlv c - 


(6.43) 


(6.44) 
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The time derivative of this is 

n 

A = y>,T, • r, + niifj ■ r t ). (6.45) 

i = 1 


The first term equals twice the kinetic energy 
of the ;th particle, and the second term contains 
a factor m,r ( - which, according to Newton’s laws, 
equals the force applied to the i th particle. Thus 
we have 

n 

A = 2T + J2 F i-ri, (6.46) 

i = i 

where T is the total kinetic energy of the system. 
If ( x) denotes the time average of x in the time 
interval [0, r], we have 


(A) = — [ Adt — (2T) + 
T Jo 



(6.47) 

If the system remains bounded, i.e. none of the 
particles escapes, all r,-’ s as well as all velocities 
will remain bounded. In such a case, A does not 
grow without limit, and the integral of the pre¬ 
vious equation remains finite. When the time in¬ 
terval becomes longer (r —»■ oo), (A) approaches 
zero, and we get 


(2T)+^J2 F <- r ij=°- (6-48) 

This is the general form of the virial theorem. If 
the forces are due to mutual gravitation only, they 
have the expressions 

n 

F i = —Grrii ]T mj- 1 . (6.49) 

j = hj?i r ‘J 

where r\j = | r, — r j |. The latter term in the virial 
theorem is now 


n n 


= ~ g J 2 J2 m ' m i ',.3 J -(n-rj), 

i = 1 f = / + l 


r ij 


where the latter form is obtained by rearranging 
the double sum, combining the terms 


1 J 3 
r. 
'J 


and 


r j — ri ri — r / 

m jirij —=- r j = m,mj -3— • (-r ; ). 

r.. r •• 

)i v 

Since (/*/ — r j) • (r/ — r j) = rfj the sum reduces 
to 

n n 

E \ > tJJ i TTl j 

E 

i = 1 j = i + l l J 

where (7 is the potential energy of the system. 
Thus, the virial theorem becomes simply 


(T) = ~\(U). 


(6.50) 


6.11 The Jeans Limit 

We shall later study the birth of stars and galax¬ 
ies. The initial stage is, roughly speaking, a gas 
cloud that begins to collapse due to its own gravi¬ 
tation. If the mass of the cloud is high enough, its 
potential energy exceeds the kinetic energy and 
the cloud collapses. From the virial theorem we 
can deduce that the potential energy must be at 
least twice the kinetic energy. This provides a cri¬ 
terion for the critical mass necessary for the cloud 
of collapse. This criterion was first suggested by 
Sir James Jeans in 1902. 

The critical mass will obviously depend on the 
pressure P and density p. Since gravitation is the 
compressing force, the gravitational constant G 
will probably also enter our expression. Thus the 
critical mass is of the form 


E f '-6 

i = l 

= it, it, m ' m J r ^- i ■ r ‘ 

i = 1 7 = r 'J 


M = CP a G b p c , (6.51) 

where C is a dimensionless constant, and the con¬ 
stants a , b and c are determined so that the right- 
hand side has the dimension of mass. The dimen¬ 
sion of pressure is kgm -1 s -2 , of gravitational 
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constant kg 1 m 3 s 2 and of density kg m 3 . 
Thus the dimension of the right-hand side is 

kg (a-b+c) m (—a+3fc—3c) s (-2a-2b) 

Since this must be kilograms ultimately, we get 
the following set of equations: 


a — b + c = 1, — a + 3b — 3c = 0, 

— 2 a — 2b — 0. 


The solution of this is a = 3/2, b — —3/2 and 
c = —2. Hence the critical mass is 


M] = C 


p 3/2 

G^V' 


(6.52) 


This is called the Jeans mass. In order to deter¬ 
mine the constant C, we naturally must calcu¬ 
late both kinetic and potential energy. Another 
method based on the propagation of waves de¬ 
termines the diameter of the cloud, the Jeans 
length X], by requiring that a disturbance of size 
kj grow unbounded. The value of the constant C 
depends on the exact form of the perturbation, but 
its typical values are in the range We 

can take C = 1 as well, in which case (6.52) gives 
a correct order of magnitude for the critical mass. 
If the mass of a cloud is much higher than Mj, it 
will collapse by its own gravitation. 

In (6.52) the pressure can be replaced by the 
kinetic temperature 7/ of the gas (see Sect. 5.8 for 
a definition). According to the kinetic gas theory, 
the pressure is 


P = nkT k , (6.53) 

where n is the number density (particles per unit 
volume) and k is Boltzmann’s constant. The num¬ 
ber density is obtained by dividing the density of 
the gas p by the average molecular weight p : 


n = p/p., 


Box 6.1 (Newton’s Laws) 

1. In the absence of external forces, a particle 
will remain at rest or move along a straight 
line with constant speed. 

2. The rate of change of the momentum of 
a particle is equal to the applied force F : 

d 

p — —( mv ) = F. 

At 

3. If particle A exerts a force F on another par¬ 
ticle B, B will exert an equal but opposite 
force — F on A. 

If several forces F F 2, ... are applied on 
a particle, the effect is equal to that caused by 
one force F which is the vector sum of the in¬ 
dividual forces {F = F\ + Fn H-)■ 

Law of gravitation'. If the masses of particles 
A and B are m 4 and m n and their mutual dis¬ 
tance r, the force exerted on A by B is directed 
towards B and has the magnitude Gin a mg/r 2 , 
where G is a constant depending on the units 
chosen. 

Newton denoted the derivative of a func¬ 
tion / by / and the integral function by f. The 
corresponding notations used by Leibniz were 
d//dr and f f dx. Of Newton’s notations, only 
the dot is still used, always signifying the time 
derivative: / = df/dt. For example, the veloc¬ 
ity r is the time derivative of r, the acceleration 
r its second derivative, etc. 


6.12 Examples 

Example 6.1 Find the orbital elements of Jupiter 
on August 23, 1996. 

The Julian date is 2,450,319, hence from 
(6.17), T — —0.0336. By substituting this into 
the expressions of Table C.12, we get 


whence 

P = pkT k /p. 

By substituting this into (6.52) we get 


Mj = C 


fkJV 

\pG 


1 

sfp' 


(6.54) 


a = 5.2033, 
e = 0.0484, 
i = 1.3053°, 

(2 = 100.5448°, 
nr= 14.7460°, 


3/2 
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L = —67.460° = 292.540°. 

From these we can compute the argument of per¬ 
ihelion and mean anomaly: 


co = nr - Q = -85.7988° = 274.201°, 
M=L-m= -82.2060° = 277.794°. 

Example 6.2 (Orbital Velocity) (a) Comet Aus¬ 
tin (1982g) moves in a parabolic orbit. Find its 
velocity on October 8, 1982, when the distance 
from the Sun was 1.10 au. 

The energy integral for a parabola is h — 0. 
Thus (6.11) gives the velocity v: 

Fiji _ Iigmq 
V r V r 


The period is obtained from Kepler’s third 
law: 


G(m i + m2) 

4tt 2 1 

~ 6.67 x 10— 11 (5 + 5) 
= 5.9 x 10 10 s 2 , 


whence 


P = 243,000 s = 2.8 d. 

Example 6.4 The period of the Martian moon 
Phobos is 0.3189 d and the radius of the orbit 
9370 km. What is the mass of Mars? 

First we change to more appropriate units: 


2 x 47r 2 x 1 


= 8.47722 au/a 


V 1.10 

8.47722 x 1.496 x 10 11 m 
365.2564 x 24 x 3600 s 


40 km/s. 


P = 0.3189 d = 0.0008731 sidereal years, 
a = 9370 km = 6.2634 x 10“ 5 au. 

Equation (6.32) gives (it is safe to assume that 

m Phobos ^ TMMars) 


(b) The semimajor axis of the minor planet 
1982RA is 1.568 au and the distance from the 
Sun on October 8, 1982, was 1.17 au. Find its 
velocity. 

The energy integral (6.16) is now 
h — —fi/2a. 

Hence 

1 2 M M 

2 r 2 a' 

which gives 


m Mars = a 3 /P 2 = 0.000000322 M e 
(«i 0.107 Mq). 

Example 6.5 Derive formulas for a planet’s he¬ 
liocentric longitude and latitude, given its orbital 
elements and true anomaly. 

We apply the sine formula to the spherical tri¬ 
angle of the figure: 

sin /) sin(ftj + /) 
sin/ sin(jr/2) 

or 

sin/3 = sin / sin(n>+ /). 



= 6.5044 au/a ~ 31 km/s. 


The sine-cosine formula gives 
cos(7r/2) sin /3 

= — cos / sin(&> + /) cos (A — C?) 
+ cos(® + /) sin(A. — ^2), 


Example 6.3 In an otherwise empty universe, w hence 

two rocks of 5 kg each orbit each other at a dis¬ 
tance of 1 m. What is the orbital period? 


tan(/\. — £2) — cos / tan(w + /). 
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Example 6.6 Find the radius vector and helio¬ 
centric longitude and latitude of Jupiter on Au¬ 
gust 23, 1996. 

The orbital elements were computed in Exam¬ 
ple 6.1: 

a — 5.2033 au, 
e = 0.0484, 
i = 1.3053°, 

Q = 100.5448°, 
co = 274.2012°, 

M = 277.7940° = 4.8484 rad. 

Since the mean anomaly was obtained directly, 
we need not compute the time elapsed since peri¬ 
helion. 

Now we have to solve Kepler’s equation. It 
cannot be solved analytically, and we are obliged 
to take the brute force approach (also called nu¬ 
merical analysis) in the form of iteration. For it¬ 
eration, we write the equation as 

E n+ 1 = M + esin E„, 

where E n is the value found in the nth iteration. 
The mean anomaly is a reasonable initial guess, 
Eq. (N.B.: Here, all angles must be in radians; 
otherwise, nonsense results!) The iteration pro¬ 
ceeds as follows: 

E 0 = M = 4.8484, 

Ei = M + e sin E 0 = 4.8004, 

E 2 = M +esin £i =4.8002, 


E 3 = M + esinE 2 = 4.8002, 

after which successive approximations no longer 
change, which means that the solution, accurate 
to four decimal places, is 

E — 4.8002 = 275.0°. 

The radius vector is 

r — a (cos E — e)i + a\! 1 — e 2 sin E j 
= 0.2045/-5.1772 j 

and the distance from the Sun, 

/■ = a(l — ecos E) — 5.1813 au. 

The signs of the components of the radius vector 
show that the planet is in the fourth quadrant. The 
true anomaly is 

-5.1772 

f — arctan-= 272.3 . 

J 0.2045 

Applying the results of the previous example, we 
find the latitude and longitude: 

sin ji = sin i sin(&> + /) 

= sin 1.3° sin(274.2° + 272.3°) 

= -0.0026 
=>• P = —0.15°, 
tan(k — J2) = cos i tan(<w + /) 

= cos 1.3° tan(274.2° + 272.3°) 

= 0.1139 
=>• k = J2 +186.5° 

= 100.5° + 186.5° 

= 287.0°. 

(We must be careful here; the equation for tan (a — 
Q) allows two solutions. If necessary, a figure 
can be drawn to decide which is the correct one.) 

Example 6.7 Find Jupiter’s right ascension and 
declination on August 23, 1996. 
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In Example 6.6, we found the longitude and 
latitude, X = 287.0°, /3 = —0.15°. The corre¬ 
sponding rectangular (heliocentric) coordinates 
are: 

x = r cos X cos — 1.5154 au, 
y = r sin X cos /3 = —4.9547 au, 
z = r sin — —0.0133 au. 

Jupiter’s ecliptic coordinates must be trans¬ 
formed to equatorial ones by rotating them around 
the x-axis by an angle s, the obliquity of the 
ecliptic (see Box 2.1): 

X] = x — 1.5154 au, 

Y j = y cos s — z sin e = —4.5405 au, 

Zj = y sin s + z cos s — — 1.9831 au. 

To find the direction relative to the Earth, we 
have to find where the Earth is. In principle, we 
could repeat the previous procedure with the or¬ 
bital elements of the Earth. Or, if we are lazy, 
we could pick up the nearest Astronomical Al¬ 
manac, which lists the equatorial coordinates of 
the Earth: 

X® = 0.8815 au, 

F© = —0.4543 au, 

Z© =-0.1970 au. 

Then the position relative to the Earth is 

X 0 = Xj - X© = 0.6339 au, 

Y 0 = Tj - F© = -4.0862 au, 

Zo = Zj — Z© = —1.7861 au. 

And finally, the right ascension and declination 
are 

a — arctan(Fo/Xo) = 278.82° = 18 h 35 min, 

S = arctan - = —23.4°. 

Pi+n 

If the values given by the Astronomical Almanac 
are rounded to the same accuracy, the same result 


is obtained. We should not expect a very precise 
position since we have neglected all short-period 
perturbations in Jupiter's orbital elements. 

Example 6.8 Which is easier, to send a probe to 
the Sun or away from the Solar system? 

The orbital velocity of the Earth is about 
30 km/s. Thus the escape velocity from the So¬ 
lar system is ~J2 x 30 ^ 42 km/s. A probe that is 
sent from the Earth already has a velocity equal 
to the orbital velocity of the Earth. Hence an extra 
velocity of only 12 km/s is needed. In addition, 
the probe has to escape from the Earth, which re¬ 
quires 11 km/s. Thus the total velocity changes 
are about 23 km/s. 

If the probe has to fall to the Sun it has to get 
rid of the orbital velocity of the Earth 30 km/s. In 
this case, too, the probe has first to be lifted from 
the Earth. Thus the total velocity change needed 
is 41 km/s. This is nearly impossible with current 
technology. Therefore a probe to be sent to the 
Sun is first directed close to some planet, and the 
gravitational field of the planet is used to acceler¬ 
ate the probe towards its final destination. 

Example 6.9 An interstellar hydrogen cloud 
contains 10 atoms per cm 3 . How big must the 
cloud be to collapse due to its own gravitation? 
The temperature of the cloud is 100 K. 

The mass of one hydrogen atom is 1.67 x 
10~ 27 kg, which gives a density 

p = n/i = 10 7 m“ 3 x 1.67 x 10 -27 kg 
= 1.67 x 10 -20 kg/m 3 . 

The critical mass is 
Mj 

1.38 x 10 -23 J/Kx 100 K \ 3/2 

1.67 x 10- 27 kg x 6.67 x ltT 11 Nm 2 kg“ 2 / 

1 

< , 

-y/l-67 x 10 -20 kg/m 3 
« 1 x 10 34 kg^5000M o . 

The radius of the cloud is 


, 3 M , 7 

R = l -«5x 10 17 mss 20 pc. 

V 4tt p 
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6.13 Exercises 

Exercise 6.1 Find the ratio of the orbital veloci¬ 
ties at aphelion and perihelion t> a /t> p . What is this 
ratio for the Earth? 

Exercise 6.2 The perihelion and aphelion of the 
orbit of Eros are 1.1084 and 1.8078 astronomical 
units from the Sun. What is the velocity of Eros 
when its distance from the Sun equals the mean 
distance of Mars? 

Exercise 6.3 Find the radius of the orbit of 
a geostationary satellite; such a satellite remains 
always over the same point of the equator of the 
Earth. Are there areas on the surface of the Earth 
that cannot be seen from any geostationary satel¬ 
lite? If so, what fraction of the total surface area? 

Exercise 6.4 From the angular diameter of the 
Sun and the length of the year, derive the mean 
density of the Sun. 


Exercise 6.5 Find the mean, eccentric and true 
anomalies of the Earth one quarter of a year after 
the perihelion. 

Exercise 6.6 The velocity of a comet is 5 m/s, 
when it is very far from the Sun. If it moved along 
a straight line, it would pass the Sun at a distance 
of 1 au. Find the eccentricity, semimajor axis and 
perihelion distance of the orbit. What will happen 
to the comet? 

Exercise 6.7 (a) Find the ecliptic geocentric ra¬ 
dius vector of the Sun on May 1, 1997 (J — 
2,450,570). 

(b) What are the declination and right ascen¬ 
sion of the Sun then? 
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The solar system consists of a central star, called 
the Sun , planets orbiting the Sun and several 
smaller objects. By the solar system we mean 
here and in the next chapter the system around 
our own Sun. Also other stars have similar sys¬ 
tems. They will be discussed later in Chap. 22. 
This chapter deals with general properties of the 
solar system. Individual objects will be discussed 
in the next chapter. 

Research of the solar system has evolved 
rapidly since the 1960’s when space probes have 
made it possible to study planets from short dis¬ 
tances. Many methods used in geosciences are 
nowadays applied also in planetary studies. Lan¬ 
ders have been sent to the Moon, Venus, Mars, 
and several smaller bodies. 

The most convenient way to describe distances 
in the solar system is to use astronomical units 
(au), the mean distance of the Sun and Earth. One 
au is = 1.495 97870 x 10 11 m(see Sect. 6.5). The 
distance to the nearest star, Proximo Centauri is 
over 270,000 au. 

Actually one au is the semimajor axis of a 
massless planet whose orbital period equals that 
of the Earth. Since also the mass of the Earth af¬ 
fects its motion, the semimajor axis of the Earth 
is slightly larger than one au (6.32). The small 
difference is, however, important only in precise 
calculations. 


7.1 Classification of Objects 

In addition to the Sun and Moon five objects mov¬ 
ing relative to the stars were already known in 


the antiquity: Mercury, Venus, Mars, Jupiter and 
Saturn. They were called planets from the Greek 
word meaning a wanderer. At that time also the 
Sun and Moon were considered planets, and the 
names of these seven objects are still reflected in 
the names of the days of the week. 

After the invention of the telescope three more 
planets were found: Uranus, Neptune and Pluto. 
When observational instruments and methods 
have evolved, more objects orbiting behind Nep¬ 
tune and of the same size as Pluto have been 
found. Since there was no unambiguous defini¬ 
tion of a planet, some of these new objects should 
have been called planets. Therefore the Interna¬ 
tional Astronomical Union (IAU) in its General 
Assembly in 2006 defined three distinct cate¬ 
gories to clarify the situation. 

According to the new definition, an object is 
a planet if it satisfies the following three condi¬ 
tions: 

(1) It orbits the Sun. 

(2) It has sufficient mass for its self-gravity 
to overcome rigid body forces so that it 
assumes a hydrostatic equilibrium (nearly 
round) shape. 

(3) Its perturbations have cleared away other ob¬ 
jects in the neighbourhood of its orbit. 

If a body satisfies the conditions (1) and (2) but 
not (3), it is a dwarf planet. Since Pluto does not 
fulfil the last requirement its status was reduced 
to a dwarf planet. 

All other objects orbiting the Sun shall be re¬ 
ferred to collectively as Small Solar System Bod¬ 
ies. These include most of the asteroids, Trans- 
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Neptunian Objects, comets, and other small bod¬ 
ies. If the central body is not the Sun the object is 
a moon or a satellite independently of its proper¬ 
ties. 

A satellite is a body which orbits the primary 
body so that the centre of mass (barycentre) is 
inside the primary. If this is not the case, then the 
system is called a binary system. For example, in 
the case of the Earth and Moon the barycentre of 
the system is inside the Earth, and the Moon is 
Earth’s satellite. In the Pluto-Charon system the 
centre of mass is outside Pluto, and therefore they 
are called a binary system. 

The rules (2) and (3) are somewhat problem¬ 
atic. How can the shape of a very distant object 
be established? How wide an area a planet should 
clean to be a planet, and how clean that region 
should be? 

Also the definition of a satellite is slightly 
problematic, since the largest planets are sur¬ 
rounded by ring systems consisting of small par¬ 
ticles. What is the distinction between the largest 
ring particles and smallest satellites? In practice, 
an object could be considered a moon if it has 
been observed as a separate body whose orbital 
elements have been determined. In any case the 
question about the exact number of satellites is 
no more meaningful, since more and more will be 
found when observations become more accurate. 
More interesting questions are e.g. the statistical 
properties of the moons. 

Other unresolved questions concern the classi¬ 
fication of the minor bodies and exoplanets. Pos¬ 
sibly the definitions need some fine-tuning in the 
future. 

According the current definition there are now 
eight planets in the solar system: Mercury, Venus, 
Earth, Mars, Jupiter, Saturn, Uranus, and Nep¬ 
tune. It looks quite improbable that more planets 
will be found. 

Currently there are five known dwarf planets: 
Pluto that was earlier a planet, the first asteroid 
Ceres and Haumea, Makemake and Eris found 
in 2004-2005 orbiting beyond Pluto. Similar ob¬ 
jects may well be found in the future. The prob¬ 
lem is to check whether these distant bodies sat¬ 
isfy the conditions (2) and (3). 

The solar system contains a vast number 
of different small bodies. Traditionally, they 


have been divided into three categories: aster¬ 
oids, comets and meteoroids. Their differences 
and properties are discussed in more detail in 
Sects. 8.10-8.14. 

The planets from Mercury to Saturn are bright 
and well visible with a naked eye. Records of 
them are found even in the most ancient writ¬ 
ten documents. Uranus and Neptune can be seen 
with a pair of binoculars. In addition to the bright 
planets, only the brightest comets are visible with 
a naked eye. 

Gravitation controls the motion of the solar 
system bodies. The planetary orbits around the 
Sun (Fig. 7.1) are almost coplanar ellipses which 
deviate only slightly from circles. Mercury, the 
innermost planet, has the most eccentric orbit. 
The orbital planes of asteroids, minor bodies 
that circle the Sun mainly between the orbits of 
Mars and Jupiter, are often more tilted than the 
planes of the planetary orbits. Asteroids and dis¬ 
tant Trans-Neptunian Objects revolve in the same 
direction as the major planets; comets, however, 
may move in the opposite direction. Cometary or¬ 
bits can be very elongated, even hyperbolic. Most 
of the satellites circle their parent planets in the 
same direction as the planet moves around the 
Sun. Only the motions of the smallest particles, 
gas and dust are affected by the solar wind, radi¬ 
ation pressure and magnetic fields. 

The planets can be divided into inferior and 
superior planets depending on their orbits. Mer¬ 
cury and Venus are inferior planets as seen from 
the Earth, and planets from Mars to Neptune are 
superior planets. 

Depending on their physical properties the 
planets can be divided into different groups 
(Fig. 7.2). Mercury, Venus, Earth, and Mars are 
called terrestrial (Earth-like) planets; they have 
a solid surface, are of almost equal size (diam¬ 
eters from 5000 to 12,000 km), and have quite 
a high mean density (4000-5000 kgm -3 ; the 
density of water is 1000 kgm -3 ). The plan¬ 
ets from Jupiter to Neptune are called Jovian 
(Jupiter-like) or giant planets. The densities of 
the giant planets are about 1000-2000 kgm -3 , 
and most of their volume is liquid. Diameters are 
several times greater than those of the terrestrial 
planets. 
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Fig. 7.1 (a) Planetary orbits from Mercury to Mars. The 
dashed line represents the part of the orbit below the eclip¬ 
tic; the arrows show the distances travelled by the planets 
during one month (January 2000). (b) Planets from Jupiter 


to Neptune and the dwarf planet Pluto. The arrows indi¬ 
cate the distances travelled by the planets during the 10 
year interval 2000-2010 



Fig. 7.2 Major planets from Mercury to Neptune. Four 
innermost planets are called terrestrial planets and four 
outermost ones are giant planets. Three dwarf planets are 


7.2 Planetary Configurations 

To the naked eye planets look starlike dots. How¬ 
ever, their slow motion with respect to the stars 
reveals that they are bodies of our solar system. 


also shown. Relative size of the Sun is shown at left. Plan¬ 
etary distances to the Sun are not in scale. (The Interna¬ 
tional Astronomical Union/Martin Kornmesser) 


The apparent motions of the planets look quite 
complicated, partly because they reflect the mo¬ 
tion of the Earth around the Sun (Figs. 7.3 and 
7.4). 
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Fig. 7.3 (a) Apparent motion of Mars during the 2016 
opposition. Usually Mars moves in the prograde direction 
(counterclockwise with respect to the stars), but over a 
month before and after the opposition the motion is retro- 

Normally the planets move eastward (direct 
motion , counterclockwise as seen from the North¬ 
ern hemisphere) when compared with the stars. 
When the Earth passes a superior planet, the mo¬ 
tion of the planet reverses to the opposite or ret¬ 
rograde direction. After a few weeks of retro¬ 
grade motion, the direction is changed again, and 
the planet continues in the original direction. It is 
quite understandable that the ancient astronomers 
had great difficulties in explaining and modelling 
such complicated turns and loops. Figure 7.5 ex¬ 
plains some basic planetary configurations. 

A superior planet (planet outside the orbit of 
the Earth) is said to be in opposition when it is 
exactly opposite the Sun, i.e. when the Earth is 
between the planet and the Sun. When the planet 
is behind the Sun, it is in conjunction. In prac¬ 


grade. (b) Relative positions of the Earth and Mars. The 
projection of the Earth-Mars direction on the infinitely 
distant celestial sphere results in (a) 


tise, the planet may not be exactly opposite or 
behind the Sun because the orbits of the planet 
and the Earth are not in the same plane. In astro¬ 
nomical almanacs oppositions and conjunctions 
are defined in terms of ecliptic longitudes. The 
longitudes of a body and the Sun differ by 180° at 
the moment of opposition; in conjunction the lon¬ 
gitudes are equal. However, the right ascension is 
used if the other body is not the Sun. Those points 
where the apparent motion of a planet changes its 
direction are called stationary points. Opposition 
occurs in the middle of the retrograde loop. 

Inferior planets (Mercury and Venus) are never 
in opposition. The configuration when the plan¬ 
ets is between the Earth and the Sun is called in¬ 
ferior conjunction. The conjunction correspond¬ 
ing to that of a superior planet is called an upper 
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Fig. 7.4 Apparent motions of the Sun, Mercury, Venus 
and Mars in 1995 in the geocentric frame as seen from the 
North pole of the ecliptic 


side of the Sun the planet is seen. The planet is 
an “evening star” and sets after the Sun when it is 
in eastern elongation; in western elongation the 
planet is seen in the morning sky as a “morning 
star”. 

The synodic period is the time interval be¬ 
tween two successive events (e.g. oppositions). 
The period which we used in the previous chap¬ 
ters is the sidereal period , the true time of revolu¬ 
tion around the Sun, unique for each object. The 
synodic period depends on the difference of the 
sidereal periods of the two bodies. 

Let the sidereal periods of two planets be P\ 
and Pi (assume that P\ < Pi). Their mean an¬ 
gular velocities (mean motions) are 2jt/Pi and 
2tx / Pi . After one synodic period Pi, 2 , the inner 
planet has made one full revolution more than 
the outer planet. Thus the angles travelled by the 
planets are related by 
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Fig. 7.5 Planetary configurations. The angle a (Sun-ob- 
ject-Earth) is the phase angle and £ (Sun-Earth-object) is 
the elongation 
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(7.1) 


Since Mercury and Venus move inside the or¬ 
bit of the Earth, they show similar phases as the 
Moon. The angle Sun-planet-Earth is called the 
phase angle, often denoted by the Greek letter a. 
The fraction of the planetary surface seen illumi¬ 
nated depends on the phase angle. For Mercury 
and Venus the phase angle can have any value be¬ 
tween 0° and 180°. This means that we can see 
“full Venus” (when it is behind the Sun), “half 
Venus”, and so on. 

The phase angle range of the superior planets 
is more limited. For Mars the maximum phase 
angle is 41°, for Jupiter 11°, and for Neptune 
only 2°. 


conjunction or superior conjunction. The maxi¬ 
mum (eastern or western) elongation, i.e. the an¬ 
gular distance of the planet from the Sun is 28° 
for Mercury and 47° for Venus. Elongations are 
called eastern or western, depending on which 


7.3 Orbit of the Earth and Visibility 
of the Sun 

The rotation of the Earth and its motion around 
the Sun have been used as the basis of time reck¬ 
oning since prehistoric times (although their true 
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nature was not known then). However, these mo¬ 
tions are not free from perturbations. Although 
our modern time system is based on atomic 
clocks, the ordinary time we use in our clocks 
is adjusted to the rotation of the Earth by leap 
seconds (Sect. 2.14). 

The sidereal year is the real orbital period 
of the Earth around the Sun. After one sidereal 
year, the Sun is seen at the same position rela¬ 
tive to the stars. The length of the sidereal year is 
365.256363051 days of 86,400 SI seconds at the 
epoch J2000.0 = 2000 January 1 12:00:00 TT. 

We noted earlier that, owing to precession, the 
direction of the vernal equinox moves along the 
ecliptic at about 50" per year. This means that the 
Sun returns to the vernal equinox before one com¬ 
plete sidereal year has elapsed. This time interval, 
called the tropical year, is 365.24218967 days. 

A third definition of the year is based on the 
perihelion passages of the Earth. Planetary per¬ 
turbations cause a gradual change in the direc¬ 
tion of the Earth’s perihelion. The time inter¬ 
val between two successie perihelion passages is 
called the anomalistic year, the length of which 
is 365.259635864 days, a little longer than the 
sidereal year. It takes about 21,000 years for the 
perihelion to revolve 360° relative to the vernal 
equinox. 

The equator of the Earth is tilted about 23.4° 
with respect to the ecliptic. Owing to perturba¬ 
tions, this angle changes with time. If periodic 
terms are neglected, the obliquity of the ecliptic e 
can be calculated as in Box 2.1. The obliquity 
varies between 22.1° and 24.5° with a 41,000 
year periodicity. At present the tilt is decreasing. 
There are also small short term variations, like the 
nutation. 

The declination of the Sun varies between — e 
and +£ during the year. At any given time, the 
Sun is seen at zenith from one point on the surface 
of the Earth. The latitude of this point is the same 
as the declination of the Sun. At the latitudes — s 
(the Tropic of Capricorn ) and +e (the Tropic of 
Cancer), the Sun is seen at zenith once every 
year, and between these latitudes twice a year. 
The Sun crosses the equator at vernal and autum¬ 
nal equinox. 

In the Northern hemisphere the Sun will not 
set if the latitude is greater than 90° — S, where 


8 is the declination of the Sun. The southern¬ 
most latitude where the midnight Sun can be seen 
is thus 90° — e — 66.6°. This is called the Arc¬ 
tic Circle. (The same holds true in the Southern 
hemisphere.) The Arctic Circle is the southern¬ 
most place where the Sun is (in theory) below 
the horizon during the whole day at the winter 
solstice. The sunless time lasts longer and longer 
when one goes north (south in the Southern hemi¬ 
sphere). At the poles, day and night last half 
a year each. In practise, refraction and location of 
the observing site will have a large influence on 
the visibility of the midnight Sun and the number 
of sunless days. Because refraction raises objects 
seen at the horizon, the midnight Sun can be seen 
a little further south than at the Arctic Circle. For 
the same reason the Sun can be seen simultane¬ 
ously at both poles around the time of vernal and 
autumnal equinox. (See also Sect. 2.6.) 

The eccentricity of the Earth’s orbit is about 
0.0167. The distance from the Sun varies between 
147-152 million km. The flux density of solar ra¬ 
diation varies somewhat at different parts of the 
Earth’s orbit, but this has practically no effect on 
the seasons. In fact the Earth is at perihelion in the 
beginning of January, in the middle of the north¬ 
ern hemisphere’s winter. 

The seasons are due to the obliquity of the 
ecliptic which affects the energy received from 
the Sun depends in three different ways. First the 
flux per unit area is proportional to sin a, where 
a is the altitude of the Sun. In summer the alti¬ 
tude can have greater values than in winter, giv¬ 
ing more energy per unit area. Another effect is 
due to the atmosphere: when the Sun is near the 
horizon, the radiation must penetrate thick atmo¬ 
spheric layers. This means large extinction and 
less radiation at the surface. The third factor is the 
length of the time the Sun is above the horizon. 
This is important at high latitudes, where the low 
altitude of the Sun is compensated by the long 
daylight time in summer. This last effect is ne¬ 
glected in many textbooks written by authors not 
living this far north. These effects are discussed 
in detail in Example 7.2. 

There are also long-term variations in the an¬ 
nual Solar flux. Serbian geophysicist Milutin Mi- 
lankovic (1879-1958) published in the 1930’s 
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and 1940’s his theory of ice ages. During the last 
2-3 million years, large ice ages have recurred 
approximately every 100,000 years. He proposed 
that variations of the Earth’s orbit cause long¬ 
term periodic climate change, now known as Mi- 
lankovic cycles. Milankovic claimed that the cy¬ 
cles in eccentricity, direction of the perihelion, 
obliquity, and precession result in 100,000 year 
ice age cycle. The cycle of precession is 26,000 
years, direction of the perihelion relative to the 
equinoxes is 22,000 years, and the obliquity of 
the ecliptic has a 41,000 year cycle. Changes 
in orbital eccentricity are not fully periodic but 
some periods above 100,000 years can be found. 
The eccentricity varies between 0.005-0.058 and 
is currently 0.0167. 

The annual incoming Solar flux varies with 
these orbital changes and the effect is largest at 
high latitudes. If, for example, the eccentricity is 
high, and the Earth is near the aphelion during the 
hemisphere’s winter, then winters are long and 
cold and summers are short. However, the the¬ 
ory is controversial, orbital forcing on the climate 
change is not well understood, and probably not 
enough to trigger glaciation. There exist also pos¬ 
itive feedback loops, like the effect of low albedo 
of snow and ice. It means that ice reflects more ra¬ 
diation back into space, thus cooling the climate. 
The system is highly chaotic so that even minor 
changes in the primary conditions will result in 
large differences in the outcome. There are also 
other effects causing climate change, including 
emerging gases from large lava flows and erup¬ 
tions of volcanos and, nowadays, anthropogenic 
reasons. 

The future is also uncertain. Some theories 
predict that the warm period will continue next 
50,000 years, whereas others conclude that the 
climate is already cooling. Anthropogenic rea¬ 
sons, like ever increasing fraction of green house 
gases, e.g. carbon dioxide, will change the short¬ 
term predictions. 


7.4 The Orbit of the Moon 

The Earth’s satellite, the Moon, circles the Earth 
counterclockwise. One revolution, the sidereal 


month, takes about 27.322 days. In practise, 
a more important period is the synodic month, 
the duration of the Lunar phases (e.g. from full 
moon to full moon). In the course of one sidereal 
month the Earth has travelled almost 1/12 of its 
orbit around the Sun. The Moon still has about 
1/12 of its orbit to go before the Earth-Moon- 
Sun configuration is again the same. This takes 
about 2 days, so the phases of the Moon are re¬ 
peated every 29 days. More exactly, the length of 
the synodic month is 29.531 days. 

The new moon is that instant when the Moon 
is in conjunction with the Sun. Almanacs define 
the phases of the Moon in terms of ecliptic lon¬ 
gitudes; the longitudes of the new moon and the 
Sun are equal. Usually the new moon is slightly 
north or south of the Sun because the lunar orbit 
is tilted 5° with respect to the ecliptic. 

About 2 days after the new moon, the wax¬ 
ing crescent moon can be seen in the western 
evening sky. About 1 week after the new moon, 
the first quarter follows, when the longitudes of 
the Moon and the Sun differ by 90°. The right half 
of the Moon is seen lit (left half when seen from 
the Southern hemisphere). Th e full moon appears 
a fortnight after the new moon, and 1 week af¬ 
ter this the last quarter. Finally the waning cres¬ 
cent moon disappears in the glory of the morning 
sky. 

The orbit of the Moon is approximately ellip¬ 
tic. The length of the semimajor axis is 
384,400 km and the eccentricity 0.055. Owing 
to perturbations caused mainly by the Sun, the 
orbital elements vary with time. The minimum 
distance of the Moon from the centre of the 
Earth is 356,400 km, and the maximum distance 
406,700 km. This range is larger than the one cal¬ 
culated from the semimajor axis and the eccen¬ 
tricity. The apparent angular diameter is in the 
range 29.4'-33.5'. 

The rotation time of the Moon is equal to the 
sidereal month, so the same side of the Moon al¬ 
ways faces the Earth. Such synchronous rotation 
is common among the satellites of the solar sys¬ 
tem: almost all large moons rotate synchronously. 

The orbital speed of the Moon varies accord¬ 
ing to Kepler’s second law. The rotation period, 
however, remains constant. This means that, at 




148 


7 The Solar System 



Fig. 7.6 Librations of the Moon can be seen in this pair of photographs taken when the Moon was close to the perigee 
and the apogee, respectively. (Helsinki University Observatory) 


different phases of the lunar orbit, we can see 
slightly different parts of the surface. When the 
Moon is close to its perigee, its speed is greater 
than average (and thus greater than the mean ro¬ 
tation rate), and we can see more of the right-hand 
edge of the Moon’s limb (as seen from the North¬ 
ern hemisphere). Correspondingly, at the apogee 
we see “behind” the left edge. Owing to this li- 
bration, a total of 59 % of the surface area can 
be seen from the Earth (Fig. 7.6). The libration is 
quite easy to see if one follows some detail at the 
edge of the lunar limb. 

There are also two other factors causing libra¬ 
tion. When the Moon is rising from the East, we 
can see a little behind the right edge, and when 
the Moon is setting, behind the left edge. The 
third effect is the latitudinal libration. Since the 
orbit of the Moon is not quite in the equatorial 
plane, we can see behind the northern or southern 
pole depending on the position of the Moon. 

The orbital plane of the Moon is tilted only 
about 5° to the ecliptic. Therefore the Moon is 
always close to the ecliptic just like the Sun and 


the planets. However, the orbital plane changes 
gradually with time, owing mainly to the pertur¬ 
bations caused by the Earth and the Sun. These 
perturbations cause the nodal line (the intersec¬ 
tion of the plane of the ecliptic and the orbital 
plane of the Moon) to make one full revolu¬ 
tion in 18.6 years. We have already encountered 
the same period in the nutation. When the as¬ 
cending node of the lunar orbit is close to the 
vernal equinox, the Moon can be 23.4° + 5° = 
28.4° north or south of the equator. When the de¬ 
scending node is close to the vernal equinox, the 
zone where the Moon can be found extends only 
23.4° — 5° = 18.4° north or south of the equator. 

The nodical or draconic month is the time in 
which the Moon moves from one ascending node 
back to the next one. Because the line of nodes is 
rotating, the nodical month is 3 hours shorter than 
the sidereal month, i.e. 27.212 days. The orbital 
ellipse itself also precesses slowly. The orbital 
period from perigee to perigee, the anomalistic 
month, is 5.5 h longer than the sidereal month, or 
about 27.555 days. 
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Gravitational differences caused by the Moon 
and the Sun on different parts of the Earth’s sur¬ 
face give rise to the tides. Gravitation is greatest 
at the sub-lunar point and smallest at the opposite 
side of the Earth. At these points, the surface of 
the seas is highest (high tide, flood). About 6 h 
after flood, the surface is lowest (low tide, ebb). 
The tide generated by the Sun is less than half of 
the lunar tide. When the Sun and the Moon are 
in the same direction with respect to the Earth 
(new moon) or opposite each other (full moon), 
the tidal effect reaches its maximum; this is called 
spring tide. 

The sea level typically varies 1 m, but in some 
narrow straits, the difference can be as great as 
15 m. Due to the irregular shape of the oceans, 
the true pattern of the oceanic tide is very com¬ 
plicated. 

The solid surface of the Earth also suffers tidal 
effects, but the amplitude is much smaller, about 
30 cm. 

Tides generate friction, which dissipates the 
rotational and orbital kinetic energy of the Earth- 
Moon system. This energy loss induces some 
changes in the system. First, the rotation of the 
Earth slows down until the Earth also rotates syn¬ 
chronously, i.e. the same side of Earth will always 
face the Moon. Secondly, the semimajor axis of 
the orbit of the Moon increases, and the Moon 
drifts away about 3 cm per year. 


7.5 Eclipses and Occultations 

An eclipse is an event in which a body goes 
through the shadow of another body. The most 
frequently observed eclipses are the lunar eclipses 
and the eclipses of the large satellites of Jupiter. 
An occultation takes place when an occulting 
body goes in front of another object; typical 
examples are stellar occultations caused by the 
Moon. Generally, occultations can be seen only 
in a narrow strip; an eclipse is visible wherever 
the body is above the horizon. 

Solar and lunar eclipses are the most spec¬ 
tacular events in the sky. A solar eclipse occurs 
when the Moon is between the Earth and the Sun 
(Fig. 7.7). (According to the definition, a solar 
eclipse is not an eclipse but an occultation!) If the 
whole disk of the Sun is behind the Moon, the 


a) Total solar eclipse 



c) Lunar eclipse 



Fig. 7.7 (a) A total solar eclipse can be seen only inside 
a narrow strip; outside the zone of totality the eclipse is 
partial, (b) An eclipse is annular if the Moon is at apogee 
from where the shadow of the Moon does not reach the 
Earth, (c) A lunar eclipse is visible everywhere where the 
Moon is above the horizon 



Fig. 7.8 The total eclipse of the Sun occurred in 1990 
over Finland. (Photo Math Martikainen) 


eclipse is total (Fig. 7.8); otherwise, it is partial. 
If the Moon is close to its apogee, the apparent 
diameter of the Moon is smaller than that of the 
Sun, and the eclipse is annular. 

If the orbital plane of the Moon coincided with 
the plane of the ecliptic, one solar and one lunar 
eclipse would occur every synodic month. How¬ 
ever, the plane is tilted about 5°; therefore, at full 
moon, the Moon must be close to the nodes for 
an eclipse to occur. The angular distance of the 
Moon from the node must be smaller than 4.6° 
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for a total lunar eclipse, and 10.3° for a total so¬ 
lar eclipse. 

Two to seven eclipses occur annually. Usually 
eclipses take place in a set of 1-3 eclipses, sepa¬ 
rated by an interval of 173 days. In one set there 
can be just one solar eclipse or a succession of 
solar, lunar and another solar eclipse. In one year, 
eclipses belonging to 2 or 3 such sets can take 
place. 

The Sun and the (ascending or descending) 
node of the lunar orbit are in the same direction 
once every 346.62 days. Nineteen such periods 
(= 6585.78 days =18 years 11 days) are very 
close to the length of 223 synodic months. This 
means that the Sun-Moon configuration and the 
eclipses are repeated in the same order after this 
period, called for historical reasons the Saws pe¬ 
riod; it was already known to the ancient Babylo¬ 
nians. 

A solar eclipse is total if the whole solar disk 
is hidden behind the Moon. In a partial eclipse 
only a part of the Sun is covered. If the Moon is 
close to its apogee its apparent diameter is smaller 
than the diameter of the Sun, and the eclipse is 
annular. 

During a solar eclipse the shadow of the Moon 
on Earth’s surface is always less than 270 km 
wide. The shadow moves at least 34 km/min; thus 
the maximum duration of a total eclipse is 
minutes. On both sides of the track of the total 
eclipse a partial eclipse can be seen. 

Also lunar eclipses can be of different kinds. 
A lunar eclipse is total if the Moon is entirely in¬ 
side the umbral shadow of the Earth; otherwise 
the eclipse is partial. If the Moon does not hit 
the umbral shadow but receives some sunlight, 
the eclipse is difficult to see with the unaided eye 
because the lunar magnitude remains almost un¬ 
changed. 

A lunar eclipse can be seen on the whole hemi¬ 
sphere where the Moon is above the horizon. The 
maximum duration of a lunar eclipse is 3.8 h, and 
the duration of the total phase is always shorter 
than 1.7 h. During the total phase the Moon is 
coloured deep red because some red light is re¬ 
fracted through the Earth’s atmosphere. 

The Moon moves eastwards, and stars are oc¬ 
culted by the dark edge of the Moon during 
the first quarter. Therefore occultation is easier 



Fig. 7.9 Transit of Venus was seen in June 6, 2004. Venus 
is seen as a small dark disc in front of the Sun. (Photo H. 
Karttunen) 


to observe, and photometric measurements are 
possible Observations of the stellar occultations 
caused by the Moon formerly served as an accu¬ 
rate method for determining the lunar orbit. Be¬ 
cause the Moon has no atmosphere, a star disap¬ 
pears abruptly in less than 1 /50 s. If a fast pho¬ 
tometer is used for recording the event, the typical 
diffraction pattern can be seen. The shape of the 
diffraction was used to determine angular diame¬ 
ters of stars ans separations of binary stars. In the 
first decades of radio astronomy the occultations 
of some radio sources were used for determining 
their exact positions. 

There are some bright stars and planets inside 
the 11° wide zone where the Moon moves, but the 
occultation of a bright, naked-eye object is quite 
rare. 

Occultations are also caused by planets and as¬ 
teroids. Accurate predictions are complicated be¬ 
cause such an event is visible only in a very nar¬ 
row path. The Uranian rings were found during 
an occultation in 1977, and the shapes of some as¬ 
teroids have been studied during some favourable 
events, timed exactly by several observers located 
along the predicted path. 

A transit is an event in which Mercury or 
Venus moves across the Solar disk as seen from 
the Earth (Fig. 7.9). A transit can occur only when 
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the planet is close to its orbital node at the time 
of inferior conjunction. Transits of Mercury oc¬ 
cur about 13 times per century; transits of Venus 
only twice. The next transits of Mercury are; 
Nov 11, 2019; Nov 13, 2032 and Nov 7, 2039. 
The next transits of Venus are: Dec 11, 2117; 
Dec 8, 2125 and Jun 11, 2247. In the 18th cen¬ 
tury the two transits of Venus (1761 and 1769) 
were used for determining the value of the astro¬ 
nomical unit. 


7.6 The Structure and Surfaces of 
Planets 

Since the 1960’s a vast amount of data have been 
collected using spacecraft, either during a flyby, 
orbiting a body, or directly landing on the surface. 
This gives a great advantage compared to other 
astronomical observations. We may even speak of 
revolution: the solar system bodies have turned 
from astronomical objects to geophysical ones. 
Many methods traditionally used in various sib¬ 
ling branches of geophysics can now be applied 
to planetary studies. 

The perturbations in the orbit of a satellite or 
spacecraft can be used in studying the internal 
structure of a planet. Any deviation from spher¬ 
ical symmetry is visible in the external gravita¬ 
tional field. The shape and irregularities of the 
gravitation field generated by a planet reflect its 
shape, internal structure and mass distribution. 
Also the surface gives certain indications on in¬ 
ternal structure and processes. 

The IAU planet definition states that planets 
are bodies in hydrostatic equilibrium. Gravity of 
a body will pull its material inwards, but the body 
resist the pull if the strength of the material is 
greater than the pressure exerted by the overly¬ 
ing layers. If the diameter is larger than about 
800-1000 km, gravity is able to deform rocky 
bodies into spherical shape. Smaller bodies than 
this have irregular shapes. On the other hand, e.g. 
icy moons of Saturn are spherical because ice is 
more easily deformed than rock. 

Hydrostatic equilibrium means that the sur¬ 
face of the body approximately follows an equi- 
potential surface of gravity. This is true e.g. on the 
Earth, where the sea surface very closely follows 


the equipotential surface called the geoid. Due to 
internal strength of rocks, continents can deviate 
from the geoid surface by a few kilometers but 
compared to the diameter of the Earth the surface 
topography is negligible. 

A rotating planet is always flattened. The 
amount of flattening depends on the rotation rate 
and the strength of the material; a liquid drop is 
more easily deformed than a rock. The shape of 
a rotating body in hydrostatic equilibrium can be 
derived from the equations of motion. If the ro¬ 
tation rate is moderate, the equilibrium shape of 
a liquid body is an ellipsoid of revolution, rotat¬ 
ing around its shortest axis. 

If R e and R p are the equatorial and polar radii, 
respectively, the shape of the planet can be ex¬ 
pressed as 


1 1 2 

x~ y~ z 

-h --1-= 1 

R- ' R- ' R 1 - 
Jv e e p 


(7.2) 


The dynamical flattening, denoted by / is defined 
as 


Re Rp 
Re 


(7.3) 


Because R e > R p , the flattening / is always pos¬ 
itive. 

The giant planets are in practise close to hy¬ 
drostatic equilibrium, and their shape is deter¬ 
mined by the rotation. The rotation period of Sat¬ 
urn is only 10.5 h, and its dynamical flattening is 
1/10 which is easily visible. 

Asteroids and other minor bodies are so small 
that they are not flattened by rotation. However, 
there is an upper limit for a rotation rate of an 
asteroid before it breaks apart due to centrifugal 
forces. If we assume that the body is held together 
only by gravity, we can approximate the maxi¬ 
mum rotation rate by setting the centrifugal force 
equal to the gravitational force: 


GMm inv 2 
R 2 ~ “r”’ 


(7.4) 


where m is a small test mass on the surface at a 
distance of R from the center of the body. Substi¬ 
tuting the rotation period P , 


2n R 
P = - 
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we get 

GM _ 4jt 2 R 

~W ~ P 2 ’ 

or 

I R 3 / 3 

P = 2^',/-= 2;r /- 

\ GM V AnGp 

If we substitute the density p with the mean den¬ 
sity of terrestrial rocks, i.e. 2700 kgm -3 , we get 
for the minimum rotation period P ^ 2 hours. 

The structure of the terrestrial planets 
(Fig. 7.10) can also be studied with seismic 
waves. The waves formed in an earthquake are 
reflected and refracted inside a planet like any 
other wave at the boundary of two different lay¬ 
ers. The waves are longitudinal or transversal 
(P and S waves , respectively). Both can propagate 
in solid materials such as rock. However, only the 
longitudinal wave can penetrate liquids. One can 
determine whether a part of the interior material 
is in the liquid state and where the boundaries 
of the layers are by studying the recordings of 
seismometers placed on the surface of a planet. 
Naturally the Earth is the best-known body, but 
quakes of the Moon, Venus, and Mars have also 
been observed. 

The interior temperatures of the planets are 
considerably larger than the surface temperatures. 
For example, the temperature in the Earth’s core 


/ 3jt 

' —. (7.5) 



Fig. 7.10 Internal structure and relative sizes of the ter¬ 
restrial planets. The percentage shows the volume of the 
core relative to the total volume of the planet. In the case 
of the Earth, the percentage includes both the outer and 
the inner core 


is about 4500-5000 K, and in the core of Jupiter 
about 30,000 K. 

A part of that heat is the remnant of the re¬ 
leased potential energy from the gravitational 
contraction during the formation of planets. De¬ 
cay of radioactive isotopes also releases heat. 
Soon after the formation of planets intense me¬ 
teorite bombardment was an important source of 
heat. Together with heat from short-lived radioac¬ 
tive isotopes this caused melting of terrestrial 
planets. The planets were differentiated: the orig¬ 
inally relatively homogeneous material became 
segregated into layers of different chemical com¬ 
position. The heaviest elements sank into centre 
thus forming a dense core. 

The terrestrial planets have an iron-nickel 
core. Mercury has the relatively largest core; 
Mars the smallest. The density of the core is 
around 10,000 kgm -3 . The Fe-Ni core is sur¬ 
rounded by a mantle, composed of silicates (com¬ 
pounds of silicon). The density of the outermost 
layers is about 3000 kgm -3 . The mean density 
of the terrestrial planets is 3500-5500 kgm -3 . 
A thin crust is the outermost layer. The crust 
and the upper mantle form a solid lithosphere, 
and the partly molten layer below that is the as- 
tenosphere. Mean densities of planets are 3500- 
5500 kgm -3 , but the density of the surface ma¬ 
terial is less than 300 kg m -3 . 

Rock is not a good heat conductor, but in ter¬ 
restrial planets conduction is the only important 
method to transfer from the interior to the sur¬ 
face. Heat transfer by convection depends on the 
viscosity material and the temperature gradient. 
In the mantle of the Earth slow vertical convec¬ 
tive flows occur below a few hundred kilome¬ 
tres. These flows drive e.g. the motion of the tec¬ 
tonic plates. Continental drift gives rise, for ex¬ 
ample, to mountain formation. The Earth is the 
only planet where plate tectonics is active today. 
On other terrestrial planets the process has either 
ceased long ago or has never occurred. 

The giant planets (Fig. 7.11) do not have 
a similar isolating outer layer, and convection 
can transfer heat all the way to the surface. 
Thus the giant planets radiate more energy than 
they receive from the Sun. Saturn radiates about 
2.8 times the heat it gets from the Sun, more than 
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Jupiter 



Fig. 7.11 Internal structure and relative sizes of the gi¬ 
ant planets. Differences in size and distance from the Sun 
cause differences in the chemical composition and internal 
structure. Due to smaller size, Uranus and Neptune do not 
have any layer of metallic hydrogen. The Earth is shown 
in scale 

any other planet. This heat is suspected to origi¬ 
nate from the separation of hydrogen and helium, 
when the heavier helium is gradually sinking to¬ 
ward the centre of the planet. In Jupiter the heat 
is mainly remnant heat from the time the planet 
was born. 

The mean densities of the giant planets are 
quite low; the density of Saturn, for example, is 
only 700 kgm -3 . (If Saturn were put in a gigan¬ 
tic bathtub, it would float on the water!) Most of 
the volume of a giant planet is a mixture of hy¬ 
drogen and helium. In the centre, there is possibly 
a silicate core, the mass of which is a few Earth 
masses. The core is surrounded by a layer of 
metallic hydrogen. Due to the extreme pressure, 
hydrogen is not in its normal molecular form tD, 
but dissociated into atoms. In this state, hydrogen 
is electrically conducting. The magnetic fields of 
the giant planets may originate in the layer of 
metallic hydrogen. 

Closer to the surface, the pressure is lower and 
hydrogen is in molecular form. The relative thick¬ 
ness of the layers of metallic and molecular hy¬ 
drogen vary from planet to planet. Uranus and 
Neptune may not have any layer of metallic hy¬ 
drogen because their internal pressure is too low 
for dissociation of the hydrogen. Instead, a layer 
of “ices” surround the core. This is a layer of 
a water-dominant mixture of water, methane and 


ammonia. Under the high pressure and tempera¬ 
ture the mixture is partly dissolved into its com¬ 
ponents and it behaves more like a molten salt and 
it is also electrically conductive like the metallic 
hydrogen. 

On top of everything is a gaseous atmosphere, 
only a few hundred kilometres thick. The clouds 
at the top of the atmosphere form the visible “sur¬ 
face” of the giant planets. 

Planetary surfaces are modified by several 
geological processes. These include continental 
drift, volcanism, meteorite impacts and climate. 
The Earth is an example of a body whose surface 
has been renewed many times during past aeons. 
The age of the surface depends on the processes 
and thus implies the geological evolutionary his¬ 
tory of the planet. 

Volcanism is a minor factor on the Earth (at 
least now), but the surface of the Jovian moon Io 
is changing rapidly due to violent volcanic erup¬ 
tions (Fig. 7.12). Volcanism on Io is caused by 
frictional heating by tides. Volcanoes have been 
observed also on Mars and Venus, but not on the 
Moon. 

Lunar craters are meteorite impact craters, 
common on almost every body with a solid sur¬ 
face. Meteorites are bombarding the planets con¬ 
tinuously, but the rate has been diminishing since 
the beginnings of the solar system. The number 
of impact craters reflects the age of the surface 
(Figs. 7.13 and 7.14). 

The Jovian moon Callisto is an example of 
a body with an ancient surface which is not 
fully inactive. Lack of small craters indicates 
some resurfacing process filling and degrading 
the minor surface features. The Earth is an ex¬ 
ample of a body, whose atmosphere both pro¬ 
tects the surface and destroys the traces of im¬ 
pacts. All smaller meteorites are burned to ashes 
in the atmosphere (one need only note the num¬ 
ber of shooting stars), and some larger bodies are 
bounced back to outer space. The traces on the 
surface are destroyed very quickly by erosion in 
less than a few million years. Venus is an even 
more extreme case where all small craters are 
missing due to a thick protective atmosphere. 

Climate has the greatest influence on the Earth 
and Venus. Both planets have a thick atmosphere. 
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Fig. 7.12 An example of 
resurfacing. Two volcanic 
plumes on Jupiter’s moon 
Io observed by Galileo 
spacecraft in 1997. One 
plume was captured on the 
bright limb or edge of the 
moon (inset at upper 
right), erupting over a 
caldera named Pillan 
Patera. The plume is 140 
kilometers high. The 
second plume, seen near 
the terminator, is called 
Prometheus. The shadow 
of the 75 km high airborne 
plume can be seen 
extending to the right of the 
eruption vent. (NASA/JPL) 



On Mars, powerful dust storms deform the land¬ 
scape, too, often covering the planet with yellow¬ 
ish dust clouds. 


7.7 Atmospheres and 
Magnetospheres 

All major planets have an atmosphere; the atmo¬ 
sphere of Mercury, however, is extremely thin. 
The giant planets are surrounded by a very thick 
gaseous layer that can be regarded as an at¬ 
mosphere. Saturn’s moon Titan has a relatively 
thick atmosphere consisting of methane. Also the 
dwarf planet Pluto has a thin atmosphere consist¬ 
ing mainly of methane. 

The composition, thickness, density and struc¬ 
ture of the atmosphere vary from planet to planet, 
but some common features can be found 
(Figs. 7.15 and 7.16). Let us first study the de¬ 
pendence of the temperature T, pressure P, and 
density p on the height h. Let us consider a cylin¬ 
der with a length dh. The change in the pres¬ 
sure dP from the height h to h + dh is pro¬ 
portional to the mass of the gas in the cylin¬ 
der: 

(7.6) 



Fig. 7.13 Ages of the surfaces of Mercury, the Earth, the 
Moon and Mars. The curve represents the fraction of the 
surface which existed at a certain time. Most of the surface 
of the Moon, Mercury and Mars are more than 3500 mil¬ 
lion years old, whereas the surface of the Earth is mostly 
younger than 200 million years 

where g is the acceleration of gravity. Equation 
(7.6) is the equation of hydrostatic equilibrium. 
(It is discussed in detail in Chap. 11 .) 

As a first approximation, we may assume that 
g does not depend on height. In the case of the 
Earth, the error is only about 3 % if g is con¬ 
sidered constant from the surface to a height of 
100 km. 


dP = — gp dh, 
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Fig. 7.14 The number of meteorite impact craters is an 
indicator of the age of the surface and the shapes of the 
craters give information on the strength of the material. 
The upper row shows Mercury (left) and the Moon, and the 
second row, the Jovian moons Europa (left), Ganymede 
(centre) and Callisto. The pictures of the Jovian moons 
were taken by the Galileo orbiter with a resolution of 
150 metres/pixel. Europa has only a few craters, there 
are areas of different ages on the surface Ganymede and 
the surface of Callisto is the oldest. Note the grooves and 


ridges that indicate different geological processes. IN the 
bottom there are two volcanic plumes on Jupiter’s moon 
Io observed by Galileo spacecraft in 1997. One plume was 
captured on the bright limb or edge of the moon (inset at 
upper right), erupting over a caldera named Pillan Patera. 
The plume is 140 kilometers high. The second plume, seen 
near the terminator, is called Prometheus. The shadow of 
the 75 km high airborne plume can be seen extending to 
the right of the eruption vent. (NASA/JPL and DLR) 


The equation of state of the ideal gas 

PV — NkT (7.7) 


gives the expression for the pressure P 
P= P —, 


where N is the number of atoms or molecules, 
k is the Boltzmann constant, // is the mass of one 


atom or molecule and 


P = 


ptN 

~V~' 


By using the equation of hydrostatic equilibrium 
(7.6) and the equation of state (7.8), we obtain 
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Fig. 7.15 Relative 
abundances of the most 
abundant gases in the 
atmospheres of Venus, 
Earth, and Mars. The 
number at the bottom of 
each circle denotes the 
surface pressure in atms 
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Fig. 7.16 (a) Temperature as a function of height in the 
atmospheres of Venus, Earth, and Mars, (b) Temperature 
profiles of the atmospheres of Jupiter and Saturn. The 
zero height is chosen to be the point where the pressure 
is 100 mbar. Numbers along the curves are pressures in 
millibars 


Integration yields P as a function of height: 
/•=/•„ exp(-/ o ffd*) 

= /„'")■ < 79) 

The variable H, which has the dimension of 
length, is called the scale height : 

kT 

H =—. (7.10) 

Pg 


The scale height defines the height at which the 
pressure has decreased by a factor e. H is a func¬ 
tion of height, but here we may assume that it is 
constant. With this approximation, we obtain 


or, using (7.8), 


h 

77 



— e~ h / H . (7.11) 

PqTo 

The scale height is an important parameter in 
many formulas describing the structure of the at¬ 
mosphere (Table 7.1). For example, if the change 
of the pressure or the density is known as a func¬ 
tion of height, the mean molecular weight of the 
atmosphere can be computed. The scale height of 
the Jovian atmosphere was determined in 1952 
when Jupiter occulted a star. With these observa¬ 
tions, the scale height was calculated to be 8 km, 
and the mean molecular weight 3-5 amu (atomic 
mass unit, 1/12 of the mass of 12 C). Thus the 
main components are hydrogen and helium, a re¬ 
sult later confirmed by spacecraft data. 

In terrestrial observations, infrared data are 
limited by water vapour and carbon dioxide. The 
scale height of CO 2 is 5 km, which means that 
the partial pressure is already halved at a height 
of 3.5 km. Thus infrared observations can be 
made on top of high mountains (like Mauna Kea 
in Hawaii). The scale height of water vapour 
is 13 km, but the relative humidity and hence 
the actual water content is very site- and time- 
dependent. 

The scale height and the temperature of the 
atmosphere define the permanence of the atmo¬ 
sphere. If the speed of a molecule is greater than 
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Table 7.1 Scale heights of some gases in the atmospheres of Venus, Earth, and Mars 


Gas 

Molecular weight [amu] 

Earth H [km] 

Venus H [km] 

Mars H [km] 

h 2 

2 

120 

360 

290 

o 2 

32 

7 

23 

18 

h 2 o 

18 

13 

40 

32 

co 2 

44 

5 

16 

13 

n 2 

28 

8 

26 

20 

Temperature [K] 


275 

750 

260 

Acceleration of gravity [m/s 2 ] 


9.81 

8.61 

3.77 


the escape velocity, the molecule will escape into 
space. The whole atmosphere could disappear in 
a relatively short time. 

According to the kinetic gas theory, the mean 
velocity v of a molecule depends both on the ki¬ 
netic temperature of the gas and the mass m of 
the molecule: 


v = 



(7.12) 


If the mass of a planet is M and its radius R, the 
escape velocity is 


I2GM 

= —. (7.13) 

Even if the mean velocity is smaller than the 
escape velocity, the atmosphere can evaporate 
into space if there is enough time, since some 
molecules will always have velocities exceed¬ 
ing v e . Assuming a velocity distribution, one can 
calculate the probability for v > v e Hence it is 
possible to estimate what fraction of the atmo¬ 
sphere will disappear in, say, 10 9 years. As a rule 
of thumb, it can be said that at least half of the 
atmosphere will remain over 1000 million years 
if the mean velocity v < 0.2u e . 

Giant planets move far from the Sun; thus 
the surface temperature is low. Also, the grav¬ 
itation is strong. Thus it is understandable that 
e.g. Jupiter has been able to retain more hydro¬ 
gen than the Earth. 

The probability that a molecule close to the 
surface will escape is insignificantly small. The 
free mean path of a molecule is very small when 
the gas density is high (Fig. 7.17). Thus the escap¬ 
ing molecule is most probably leaving from the 


Fig. 7.17 Close to the surface, the mean free path of 
a molecule is smaller than higher in the atmosphere where 
the gas density is smaller. The escaping molecules origi¬ 
nate close to the critical layer 

uppermost layers. The critical layer is defined as 
a height at which a molecule, moving upward, has 
a probability 1 /e of hitting another molecule. The 
part of the atmosphere above the critical layer is 
called the exosphere. The exosphere of the Earth 
begins at a height of 500 km, where the kinetic 
temperature of the gas is 1500-2000 K and the 
pressure is lower than in the best terrestrial vacu¬ 
ums. 

The magnetosphere is the “outer boundary” of 
a planet. Size and shape depend on the strength 
of the magnetic field of the planet and on the 
solar wind. The solar wind is a flux of charged 
particles, mostly electrons and protons, outflow- 
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Fig. 7.18 Structure of the magnetosphere of the Earth. (A. Nurmi/Tiede 2000) 


ing from the Sun. The speed of the wind at the 
distance of the Earth is about 500 km/s and the 
density 5-10 particles/cm 3 but both values can 
change considerably depending on the solar ac¬ 
tivity. 

On the solar side there is a bow shock 
(Fig. 7.18), typically at a distance of a few tens 
of planetary radii (Table 7.2). At the bow shock, 
particles of the solar wind first hit the magne¬ 
tosphere. The magnetosphere is limited by the 
magnetopause , flattened on the solar side and ex¬ 
tended to a long tail on the opposite side. Charged 
particles inside the magnetopause are captured by 
the magnetic field and some particles are accel¬ 
erated to great velocities (Fig. 7.19). If the ve¬ 
locities are interpreted according to the kinetic 
gas theory, these velocities even correspond to 
millions of kelvins. However, the density, and 
thus the total energy, is very small. The “hottest” 


places found are around Jupiter and Saturn, where 
the particle velocities correspond even tempera¬ 
tures of 400 million kelvins. 

The region of space containing trapped char¬ 
ged particles, the radiation belts around the Earth, 
are named van Allen’s belts. These radiation 
zones were discovered by the first US satellite, 
Explorer 1, in 1958. 

The number of charged particles increases af¬ 
ter strong solar bursts. Some of these particles 
“leak” to the atmosphere, resulting in auroras. 
Similar effects have also been detected in Jupiter, 
Saturn and Uranus. 

The solar magnetic field arises from the turbu¬ 
lent motions of the electrically conductive matter. 
The energy driving the convection in the layer is 
coming from the nuclear fusion in the core. This, 
however, cannot explain planetary magnetism. 
Neither can the remanent primordial magnetic 
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Table 7.2 Planetary magnetic fields 



Dipole moment (Earth = 1) 

Field strength (gauss ) 3 

Polarity b 

Angle 3 

Magneto-pause d 

Mercury 

0.0007 

0.003 


14° 

1.5 

Venus 

<0.0004 

<0.00003 

- 

- 

- 

Earth 

1.0 

0.305 

u 

11 ° 

10 

Mars 

< 0.0002 

<0.0003 

- 

- 

- 

Jupiter 

20 , 000 . 

4.28 

11 

10 ° 

80 

Saturn 

600. 

0.22 

11 

< 1 ° 

20 

Uranus 

50. 

0.23 

11 

o 

On 

in 

20 

Neptune 

25. 

0.14 

11 

47 ° 

25 


3 At equator (1 gauss equals 10 4 T) 
same as the Earth, 4)- opposite 
c Angle between magnetic and rotational axes 

d Average magnetopause distance in the direction of the Sun in planetary radii 



Fig. 7.19 A glow of hot plasma trapped inside the Earth’s 
magnetosphere. The picture was taken by NASA's Im¬ 
ager for Magnetopause to Aurora Global Exploration (IM¬ 
AGE) spacecraft on August 11, 2000 at 18:00 UT. The 
Sun is outside the picture area toward the top right comer. 
(NASA and the IMAGE science team) 


field explain it because the internal temperature 
of planets is well above the Curie point (about 
850 K for magnetite). If the temperature is above 
the Curie point, ferromagnetic materials will lose 
their remanent magnetism. 


The planetary dynamo generating the mag¬ 
netic held requires that the planet is rotating and 
has a convective layer of electrically conductive 
material. The temperature gradient across this 
layer must also be high enough to maintain con¬ 
vection. Terrestrial planets have a liquid Fe-Ni 
core, or a liquid layer in the core, Jupiter and Sat¬ 
urn have a layer of liquid metallic hydrogen and 
Uranus and Neptune have a mixture of water, am¬ 
monia and methane. 

The strength of the magnetic held varies a lot 
from planet to planet. It can be characterised 
by the dipole magnetic moment. The magnetic 
moment of Jupiter is about 100 million times 
that of Mercury. The magnetic moment of the 
Earth is about 7.9 x 10 25 gauss cm 3 that can 
be compared to the typical strong electromag¬ 
netic helds achieved in the laboratories, about 
100,000 gauss cm 3 . Inducing such a strong held 
requires currents that are of the order of 10 9 Am¬ 
peres. When divided by the cube of planetary 
radii, one gets an estimate of the held strength 
on the equator. 

The alignment of the magnetic held with re¬ 
spect to the rotation axis of a planet differs from 
planet to planet (Fig. 7.20). Saturn’s magnetic 
held is close to the ideal case where rotational 
axis and magnetic axis coincide. Also the Earth 
and Jupiter show reasonably good point dipole 
held with a tilt of about 10°. However, helds of 
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Fig. 7.20 Planetary magnetic fields 


Uranus and Neptune are both offset from the cen¬ 
tre of the planet and tilted by about 50° from the 
rotation axis. This may indicate a different mech¬ 
anism for the dynamo. 

The magnetic fields of Mercury and the Earth 
have an opposite polarity than the fields of 
other planets. It is known that the polarity of 
the Earth’s magnetic field has reversed several 
times over geologic time scales, previously about 
750,000 years ago. There are some indications 
that the reversal of the polarity is beginning now 
because the field strength is declining about one 
percent per decade, magnetic poles are moving 
more rapidly and the field asymmetry is increas¬ 
ing. The whole process will take several thousand 
years during which the Earth’s surface is more 
open to the cosmic rays. 

The Galileo mission also revealed that the Jo¬ 
vian moon Ganymede has a magnetic field. The 
field is weak and too small to have a magneto¬ 
tail or trapped particles around the moon. Cal- 
listo, which is of the same size, does not show 
any magnetosphere. Neither does our Moon have 
any global magnetic field. 


7.8 Albedos 

The planets and all other bodies of the solar sys¬ 
tem only reflect the radiation of the Sun (we may 
neglect here the thermal and radio wave radia¬ 
tion and concentrate mainly on the visual wave¬ 
lengths). The brightness of a body depends on its 
distance from the Sun and the Earth, and on the 
albedo of its surface. The term albedo defines the 
ability of a body to reflect light. 



Fig. 7.21 Symbols used in the photometric formulas. The 
angle a is the phase angle 


If the luminosity of the Sun is Lq, the flux 
density at the distance r is (Fig. 7.21) 


F = 


Lq 
4 nr 2 


(7.14) 


If the radius of the planet is R, the area of its cross 
section is itR 2 , and the total flux incident on the 
surface of the planet is 


2 Lq LqR 
47tr 2 4 r 2 


(7.15) 


Only a part of the incident flux is reflected back. 
The other part is absorbed and converted into 
heat which is then emitted as a thermal emission 
from the planet. The Bond albedo A (or spheri¬ 
cal albedo) is defined as the ratio of the emergent 
flux to the incident flux (0 < A < 1). The flux re¬ 
flected by the planet is thus 


= AL in = 


ALqR 2 
4 r 2 


(7.16) 


The planet is observed at a distance A. If radi¬ 
ation is reflected isotropically, the observed flux 
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density should be 

F =ttr (7J7 > 

In reality, however, radiation is reflected aniso- 
tropically. If we assume that the reflecting object 
is a homogeneous sphere, the distribution of the 
reflected radiation depends on the phase angle a 
only. Thus we can express the flux density ob¬ 
served at a distance A as 

F = C0(a) ^- (7-18) 

The function 0 giving the phase angle depen¬ 
dence is called the phase function. It is nor¬ 
malised so that 0 (a = 0°) = 1. 

Since all the radiation reflected from the planet 
is found somewhere on the surface of the sphere, 
we must have 

f s C0(a) 4^2 d5 = L ° ut (7 ' 19) 


Remembering that L out = AL m , (7.18) can be 
written in the form 


F 


CA 

— 0(a) 
4tt 


1 

A2 Ll! 


(7.25) 


The first factor is intrinsic for each object, the sec¬ 
ond gives the phase angle dependence, the third 
the distance dependence and the fourth, the inci¬ 
dent radiation power. The first factor is often de¬ 
noted by 



(7.26) 


When we substitute here the expression of C 
(7.24), and solve for the Bond albedo, we get 


4jt r 4 

A — = n r — = jtTq = pq. (7.27) 

Here p — tcT is called the geometric albedo 
and q is the previously introduced phase integral. 
These quantities are related by 


or 

4^2//(«)“=>. (7-20) 

where the integration is extended over the surface 
of the sphere of radius A. The surface element 
of such a sphere is dS = A 2 da sin a df, and we 
have 


/, 


0(a) dS = 


0 (a ) sin a da d<p 


(* 7T (* LIZ 

= A 2 f j 
J a = 0 J (f) = 0 

= A 2 27 r / <J(a)sinada. 

Jo 


(7.21) 


The normalisation constant C is 
4?rA 2 2 

C = 


f s 0(a) dS 0 (a) sin a da 

The quantity 


. (7.22) 


q — 2 f 0 (a) sin a da (7.23) 

J o 

is the phase integral. In terms of the phase inte¬ 
gral the normalisation constant is 


A — pq. (7.28) 


The geometric albedo seems to have appeared 
as an arbitrary factor with no obvious physical in¬ 
terpretation. Well now try to explain this quan¬ 
tity using a Lambertian surface. A Lambertian 
surface is defined as an absolutely white, diffuse 
surface which reflects all radiation, i.e. its Bond 
albedo is A = 1. Moreover, its surface bright¬ 
ness is the same for all viewing directions, which 
means that the phase function is 


0(a) = 


cos a, 

0, 


if 0 < a < 7r/2, 
otherwise. 


(7.29) 


In reality, no such surface exists but there are 
some materials which behave almost like a Lam¬ 
bertian surface. A wall with a mat white finish is 
a good approximation; although it doesn’t reflect 
all incident light, the distribution of the reflected 
light is about right, and its brightness looks the 
same from all directions. 

For a Lambertian surface the constant C is 



C = 


2 

Jq 0(a) sin a da 


(7.24) 
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2 

Jq^ cos a sin a da 
2 



(7.30) 


Thus the geometric albedo of a Lambertian sur¬ 
face is 


CA 4x1 

p = 7tT = —— — —— = 1. 

4 4 


(7.31) 


At the phase angle zero 0(a = 0°) = 1 and 
the reflected flux density is 


F = 


CA 1 
47T A 2 


If we replace the object with a Lambertian surface 
of the same size, we get 


Fl = 


4 1 
47r A 2 1 


The ratio of these flux densities is 
F CA 

- = —=7rr = p. (7.32) 

Now we have found a physical interpretation 
for p: the geometric albedo is the ratio of the 
flux densities at phase angle a = 0° reflected by 
a planet and a Lambertian surface of the same 
cross section. 

The geometric albedo depends on the re¬ 
flectance of the surface but also on the phase 
function 0. Many rough surfaces reflect most of 
the incident radiation directly backward. In such 
a case the geometric albedo p is greater than in 
the case of an isotropically reflecting surface. On 
some surfaces p > 1; the most extreme case is a 
mirror, for which the specular reflection, p — oo. 
The geometric albedo of solar system bodies vary 
between 0.03-1. The geometric albedo of the 
Moon is p — 0.12 and the greatest value, p — 1.0, 
has been measured for the Saturnian moon Ence- 
ladus. 

It turns out that p can be derived from the ob¬ 
servations, but the Bond albedo A can be deter¬ 
mined only if the phase integral q is also known. 
That will be discussed in the next section. 


7.9 Photometry, Polarimetry and 
Spectroscopy 


Having defined the phase function and albedos 
we are ready to derive a formula for planetary 
magnitudes. The flux density of the reflected light 
is 


F 


CA 

- 0(a) 

An 


1 

A 2 ^ 


We now substitute the incident flux 


_L Q R 2 
m 4 r 2 

and the constant factor expressed in terms of the 
geometric albedo 


CA _ _ p 

An n 

Thus we get 


F 


-0(a) 

TV 


1 LqR 2 
A 2 4r 2 


(7.33) 


The observed solar flux density at a distance of 
a = I au from the Sun is 


Fq = 


l q 
4n a 2 


The ratio of these is 


F p0(a)R 2 a 2 
~Fq ~ ~A 2 r 2 


(7.34) 


(7.35) 


If the apparent solar magnitude at a distance of 
1 au is m© and the apparent magnitude of the 
planet m we have 


m — niQ = —2.5 lg 


F 

~Fq 


= —2.51g 


p0(a)R 2 a 2 
A 2 r 2 


— —2-51g 


pR 2 a 4 
a 2 A 2 r 2 


0(a) 


R 2 a 4 

— —2.5 lg p-^- — 2.5 lg —t~2 


- 2.5 lg 0(a) 
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R 2 A r 

= -2.5lgp— + 51g — 

a z a*- 

- 2.5lg0(a). (7.36) 

If we denote 

R 2 

V(1,0) = m G — 2.5 lg p — , (7.37) 

a*~ 

then the magnitude of a planet can be expressed 
as 

r A 

m — V(1,0) T 5 lg —z—2.5 lg 0(a). (7.38) 

a L 

The first term V (1,0) depends only on the size 
of the planet and its reflection properties. So it is 
a quantity intrinsic to the planet, and it is called 
the absolute magnitude (not to be confused with 
the absolute magnitude in stellar astronomy!). 
The second term contains the distance depen¬ 
dence and the third one the dependence on the 
phase angle. 

If the phase angle is zero, and we set r = A = 
a, (7.38) becomes simply m — V(1,0). The ab¬ 
solute magnitude can be interpreted as the mag¬ 
nitude of a body if it is at a distance of 1 au from 
the Earth and the Sun at a phase angle a = 0°. 
As will be immediately noticed, this is physically 
impossible because the observer would be in the 
very centre of the Sun. Thus V (1, 0) can never be 
observed. 

By using (7.37) and (7.38) at a — 0°, the ge¬ 
ometric albedo can be solved for in terms values 
all obtainable from observations. 

p= (—) (7.39) 

\aR ) 

where m o = m(a = 0°). As can easily be seen, 
p can be greater than unity but in the real world, it 
is normally well below that. Typical values for p 
are in the range 0.1-0.5. 

The last term containing the phase angle de¬ 
pendence in (7.38) is the most problematic one. 
For many objects the phase function is not known 
very well. This means that from the observations, 
one can calculate only 

V(l, a) = V(l, 0) — 2.5 lg 0(a), (7.40) 


which is the absolute magnitude at phase an¬ 
gle a. V (1, a), plotted as a function of the phase 
angle, is called the phase curve (Fig. 7.22). The 
phase curve extrapolated to a = 0° gives V (1,0). 
The shape of the phase curve is very different for 
objects with or without an atmosphere. 

The Bond albedo can be determined only if 
the phase function 0 is known. Superior planets 
(and other bodies orbiting outside the orbit of the 
Earth) can be observed only in a limited phase an¬ 
gle range, and therefore 0 is poorly known, ex¬ 
cept for those bodies that have been observed by 
spacecraft. The situation is somewhat better for 
the inferior planets. Especially in popular texts 
the Bond albedo is given instead of p (naturally 
without mentioning the exact names!). A good 
excuse for this is the obvious physical meaning of 
the former, and also the fact that the Bond albedo 
is normalised to [0, 1]. 

Opposition Effect If an object has an atmo¬ 
sphere it reflects light more or less isotropically 
to all directions. The flux density of the reflected 
light is then proportional to the area of the visible 
illuminated surface (actually to the projection of 
this area on a plane perpendicular to the line of 
sight). Atmosphereless bodies reflect light more 
strongly to the direction of the incident light. 
Hence the brightness increases rapidly when the 
phase angle approaches zero. When the phase is 
larger than about 10°, the changes are smaller. 
This rapid brightening close to the opposition is 
called the opposition effect. An atmosphere de¬ 
stroys the opposition effect. 

The full explanation is still in dispute. A qual¬ 
itative (but only partial) explanation is that close 
to the opposition, no shadows are visible. When 
the phase angle increases, the shadows become 
visible and the brightness drops. The main rea¬ 
son, however, is the coherent backscatter due to 
the wave properties of the light. 

Magnitudes of Asteroids The shape of the 
phase curve depends on the geometric albedo. It 
is possible to estimate the geometric albedo if the 
phase curve is known. This requires at least a few 
observations at different phase angles. Most crit¬ 
ical is the range 0°-10°. A known phase curve 
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Fig. 7.22 The phase curves and polarisation of differ¬ 
ent types of asteroids. The asteroid characteristics are dis¬ 
cussed in more detail in Sect. 8.11. (From Muinonen et 

can be used to determine the diameter of the 
body, e.g. the size of an asteroid. Apparent di¬ 
ameters of asteroids are so small that for ground 
based observations one has to use indirect meth¬ 
ods, like polarimetric or radiometric (thermal ra¬ 
diation) observations (Fig. 7.22). Beginning from 
the 1990’s, imaging made during spacecraft fly¬ 
bys and with the Hubble Space Telescope have 
given also direct measures of the diameter and 
shape of asteroids. 

When the phase angle is greater than a few de¬ 
grees, the magnitude of an asteroid depends al¬ 
most linearly on the phase angle. Earlier this lin¬ 
ear part was extrapolated to a — 0° to estimate 
the opposition magnitude of an asteroid. Due to 
the opposition effect the actual opposition mag¬ 
nitude can be considerably brighter. 

In 1985 the IAU adopted the HG system for 
magnitudes of atmosphereless bodies. Formally, 


al ., Asteroid photometric and polarimetric phase effects, 
in Bottke, Binzel, Cellino, Paolizhi (Eds.) Asteroids III, 
University of Arizona Press, Tucson) 

it was semi-empirical, although it was based on 
photometric theories by Lumme and Bowell. In 
the 2012 meeting this was replaced by a new 
HG 1 G 2 system. Although the older HG system 
was useful in many cases, it was not satisfac¬ 
tory if the opposition effect was very small or re¬ 
stricted to very small phase angles. 

In the new system the magnitude at phase an¬ 
gle a is 

V{\ ,a) 

= -2.51g[ai0i(a) + a 2 0 2 (a) + a3@3 (a)] 
= #-2.51g[Gi<2>i(a) + G 2 <Z>2(a) 

+ (1-G 1 -G 2 )4> 3 (a)]. (7-41) 

where the values of the basis functions <t> 1 , &2 
and <t >3 are found by spline interpolations from 
the following tables: 
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an 

01 

0.0 

1.0 

7.5 

0.75 

30.0 

0.33486016 

60.0 

0.13410560 

90.0 

0.05110476 

120.0 

0.02146569 

150.0 

0.00363970 

0 |( 7 . 5 °) = — 1.90986 

<p;( 150 °) = - 0.09133 


and hence H is just the absolute magnitude in op¬ 
position. The constants G\ and G 2 describe the 
shape of the phase curve. 

Asteroid data has earlier been published in the 
yearbook Efemeridy malyh planet. Currently the 
best source is the web pages of the Minor Planet 
Center: http://www.cfa.harvard.edu/iau/services/ 
WebCS Access .html . 


a[°] 

( p 2 

0.0 

1.0 

7.5 

0.925 

30.0 

0.62884169 

60.0 

0.31755495 

90.0 

0.12716367 

120.0 

0.02237390 

150.0 

0.00016506 

0 '( 7 . 5 °) = - 0.57330 

0 '( 15 O°) =- 8.657 x 10 -8 


Polarimetric Observations The light reflected 
by the bodies of the solar system is usually po¬ 
larised, at least to some degree. The amount of 
polarisation depends on the reflecting material 
and also on the geometry: polarisation is a func¬ 
tion of the phase angle. The degree of polarisa¬ 
tion P is defined as 


P= F± ~ F \ 
F± + F„ 


(7.44) 


a[°] 

0 3 

0.0 

1.0 

0.3 

0.83381185 

1.0 

0.57735424 

2.0 

0.42144772 

4.0 

0.23174230 

8.0 

0.10348178 

12.0 

0.06173347 

20.0 

0.01610701 

30.0 

0.0 

0 '(O°) =- 0.10630 

0 j( 3 O°) = O 


When the phase angle is zero all the functions 
have the value 1. If the phase angle is a < 7.5°, 
0 1 ans 02 are linear functions: 0 \ (a) — 1 — 
a/30°, 0 2 (o!) = 1 - a/100°. 

Fitting an expression in terms of the basis 
functions to the observed phase curve one gets 
the coefficients a,-, and then further 

H — -2.5 lg(ai + «2 + 03 ), 

G\ — a\/(a\ + 02 + < 23 ), (7.42) 

G 2 = 02 / (a\ +a 2 + 03 ), 

When the phase angle is zero, we have 

V{\, 0) = H - 2.51g[Gi + G 2 + 1 - Gi - G 2 ] 
= H, (7.43) 


where F± is the flux density of radiation, per¬ 
pendicular to a fixed plane, called the scattering 
plane, and Fj| is the flux density parallel to the 
plane. In solar system studies, polarisation is usu¬ 
ally referred to the plane defined by the Earth, the 
Sun, and the object. According to (7.44), P can 
be positive or negative; thus the terms “positive” 
and “negative” polarisation are used. 

The degree of polarisation as a function of the 
phase angle depends on the surface structure and 
the atmosphere. The degree of polarisation of the 
light reflected by the surface of an atmosphereless 
body is positive when the phase angle is greater 
than about 20°. Closer to opposition, polarisa¬ 
tion is negative. A dependence between the polar¬ 
isation and geometric albedo has been observed. 
This gives an independent method for determin¬ 
ing the albedo and the size. 

When light is reflected from an atmosphere, 
the degree of polarisation as a function of the 
phase angle is more complicated. For some phase 
angles P can be highly negative. Using the theory 
of radiative transfer, one can compute how the at¬ 
mosphere affects light and its polarisation. Com¬ 
paring these results with observations one can ob¬ 
tain information about the contents of the atmo¬ 
sphere. For example, the composition of Venus’ 
atmosphere could be studied by polarisation stud¬ 
ies before any probes were sent to the planet. 
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Fig. 7.23 Spectra of the Moon and the giant planets. Strong absorption bands can be seen in the spectra of Uranus and 
Neptune. (Lowell Observatory Bulletin 42 (1909)) 


Planetary Spectroscopy The photometric and 
polarimetric observations discussed above were 
monochromatic. However, the studies of the at¬ 
mosphere of Venus also used spectral informa¬ 
tion. Broadband UBV photometry or polarime- 
try is the simplest example of spectrophotometry 
(spectropolarimetry). The term spectrophotome¬ 
try usually means observations made with sev¬ 
eral narrowband filters. Naturally, solar system 
objects are also observed by means of “classical” 
spectroscopy. 

Spectrophotometry and polarimetry give in¬ 
formation at discrete wavelengths only. In prac¬ 
tise, the number of points of the spectrum (or 
the number of filters available) is often limited 
to 20-30. This means that no details can be seen 
in the spectra. On the other hand, in ordinary 
spectroscopy, the limiting magnitude is smaller, 
although the situation is rapidly improving with 
the new generation detectors, such as the CCD 
camera. 

The spectrum observed is the spectrum of the 
Sun. Generally, the planetary contribution is rel¬ 


atively small, and these differences can be seen 
when the solar spectrum is subtracted. The Ura¬ 
nian spectrum is a typical example (Fig. 7.23). 
There are strong absorption bands in the near- 
infrared. Laboratory measurements have shown 
that these are due to methane. A portion of the 
red light is also absorbed, causing the greenish 
colour of the planet. The general techniques of 
spectral observations are discussed in the context 
of stellar spectroscopy in Chap. 9. 


7.10 Thermal Radiation of the 
Planets 

Thermal radiation of the solar system bodies de¬ 
pends on the albedo and the distance from the 
Sun, i.e. on the amount of absorbed radiation. In¬ 
ternal heat is important in Jupiter and Saturn, but 
we may neglect it at this point. 

By using the Stefan-Boltzmann law, the flux 
on the surface of the Sun can be expressed as 

L = 4n RqCtTq. 
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If the Bond albedo of the body is A, the fraction 
of the radiation absorbed by the planet is (1 — 
A). This is later emitted as heat. If the body is at 
a distance r from the Sun, the absorbed flux is 


L 


abs — 


R 2 aT*jtR 2 

-(1 — A). 

r L 


(7.45) 


There are good reasons to assume that the body 
is in thermal equilibrium , i.e. the emitted and 
the absorbed fluxes are equal. If not, the body 
will warm up or cool down until equilibrium is 
reached. 

Let us first assume that the body is rotating 
slowly. The dark side has had time to cool down, 
and the thermal radiation is emitted mainly from 
one hemisphere. The flux emitted is 


Lem — 2 jtR 2 (jT 4 , (7.46) 


where T is the temperature of the body and 2n R 2 
is the area of one hemisphere. In thermal equilib¬ 
rium, (7.48) and (7.49) are equal: 


4 r 0 

r 2 


(1 - A) = 2T 4 , 


whence 


T = 




(7.47) 


A body rotating quickly emits an approximately 
equal flux from all parts of its surface. The emit¬ 
ted flux is then 


Table 7.3 Theoretical and observed temperatures of 
some planets 



Albedo 

Distance 
from 
the Sun 
[AU] 

Theoretical 

temperature 

[K] 

Observed 

maximum 

temperature 

[K] 

(7.50) 

(7.51) 

Mercury 

0.06 

0.39 

525 

440 

700 

Venus 

0.76 

0.72 

270 

230 

750 

Earth 

0.36 

1.00 

290 

250 

310 

Mars 

0.16 

1.52 

260 

215 

290 

Jupiter 

0.73 

5.20 

110 

90 

130 


effect: radiation is allowed to enter, but not to exit. 
The same effect is at work in the Earth’s atmo¬ 
sphere. Without the greenhouse effect, the mean 
temperature could be well below the freezing 
point and the whole Earth would be ice-covered. 
Particularly strong the effect is on Venus, where 
the surface temperature is hundreds of degrees 
higher than the theoretical value. 

According to the Wien displacement law 
(5.22) k max = b/T the radiation maximum of 
a body at 200 K is at X — 14 pm, deep in in¬ 
frared. When the thermal radiation in the infrared 
or radio range is measured the temperature can be 
found, and further the Bond albedo can be calcu¬ 
lated from (7.47) or (7.48). If also the phase func¬ 
tion is known the geometric albedo and hence the 
diameter can be evaluated. 
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Lem — 4 tz R 2 crT‘ i 
and the temperature 



The theoretical temperatures obtained above 
are not valid for most of the major planets. The 
main “culprits” responsible here are the atmo¬ 
sphere and the internal heat. Measured and the¬ 
oretical temperatures of some major planets are 
compared in Table 7.3. Venus is an extreme ex¬ 
ample of the disagreement between theoretical 
and actual figures. The reason is the greenhouse 


Cosmogony is a branch of astronomy which stud¬ 
ies the origin of the solar system. The first steps 
of the planetary formation processes are closely 
connected to star formation. 

Although the properties and details of the bod¬ 
ies of our solar system (see next chapter) may 
look wildly different there are some distinct fea¬ 
tures which have to be explained by any serious 
cosmogonical theory. These include: 

- planetary orbits are almost coplanar and also 
parallel to the solar equator; 

- orbits are almost circular; 

- planets orbit the Sun counterclockwise, which 
is also the direction of solar rotation; 
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Table 7.4 True distances of the planets from the Sun and 
distances according to the Titius-Bode law (7.49) 


Planet 

n 

Calculated 

distance 

[AU] 

True 

distance 

[AU] 

Mercury 

—oo 

0.4 

0.4 

Venus 

0 

0.7 

0.7 

Earth 

i 

1.0 

1.0 

Mars 

2 

1.6 

1.5 

Ceres 

3 

2.8 

2.8 

Jupiter 

4 

5.2 

5.2 

Saturn 

5 

10.0 

9.2 

Uranus 

6 

19.6 

19.2 

Neptune 

7 

38.8 

30.1 

Pluto 

8 

77.2 

39.5 


- planets also rotate around their axes counter¬ 
clockwise (excluding Venus and Uranus); 

- planets have 99 % of the angular momentum 
of the solar system but only 0.15 % of the total 
mass; 

- terrestrial and giant planets exhibit physical 
and chemical differences; 

- relative abundances of ices and rocks as a func¬ 
tion of the distance from the Sun. 

Sometimes also the empirical Titius-Bode law 
is included (Table 7.4). It states that 

a = 0.4 + 0.3 x 2", 
n =-oo, 0,1,2,... (7.49) 

where the semimajor axis a is expressed in au. 

It is sometimes mentioned that the first scien¬ 
tific theory was the vortex theory by the French 
philosopher Rene Descartes in 1644; however it 
was concerned about the motion of the solar sys¬ 
tem bodies and not its origin. 

The first modern cosmogonical theories were 
introduced in the 18th century. One of the first 
cosmogonists was Immanuel Kant , who in 1755 
presented his nebular hypothesis. According to 
this theory, the solar system condensed from 
a large rotating nebula. Kant’s nebular hypoth¬ 
esis is surprisingly close to the basic ideas of 
modern cosmogonical models. In a similar vein, 
Pierre Simon de Laplace suggested in 1796 that 


the planets have formed from gas rings ejected 
from the equator of the collapsing Sun. 

The main difficulty of the nebular hypothesis 
was its inability to explain the distribution of an¬ 
gular momentum in the solar system. Although 
the planets represent less than 1 % of the total 
mass, they possess 98 % of the angular momen¬ 
tum. There appeared to be no way of achieving 
such an unequal distribution. A second objection 
to the nebular hypothesis was that it provided no 
mechanism to form planets from the postulated 
gas rings. 

Already in 1745, Georges Louis Leclerc de 
Buff on had proposed that the planets were formed 
from a vast outflow of solar material, ejected 
upon the impact of a large comet. Various catas¬ 
trophe theories were popular in the 19th century 
and in the first decades of the 20th century when 
the cometary impact was replaced by a close en¬ 
counter with another star. The theory was devel¬ 
oped, e.g. by Forest R. Moulton (1905) and James 
Jeans (1917). 

Strong tidal forces during the closest approach 
would tear some gas out of the Sun; this mate¬ 
rial would later accrete into planets. Such a close 
encounter would be an extremely rare event. As¬ 
suming a typical star density of 0.15 stars per 
cubic parsec and an average relative velocity of 
20 km/s, only a few encounters would have taken 
place in the whole Galaxy during the last 510 9 
years. The solar system could be a unique speci¬ 
men. This is clearly against modern observations 
(Chap. 22). 

The main objection to the collision theory is 
that most of the hot material torn off the Sun 
would be thrown out to space, rather than remain¬ 
ing in orbit around the Sun. There also was no 
obvious way how the material could form a plan¬ 
etary system. 

In the face of the dynamical and statistical dif¬ 
ficulties of the collision theory, the nebular hy¬ 
pothesis was revised and modified in the 1940’s. 
In particular, it became clear that magnetic forces 
and gas outflow could efficiently transfer angular 
momentum from the Sun to the planetary nebula. 
The main principles of planetary formation are 
now thought to be reasonably well understood. 

The oldest rocks found on the Earth are about 
3.7 x 10 9 years old; some lunar and meteorite 
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Fig. 7.24 Hubble Space 
Telescope images of four 
protoplanetary disks, 
“proplyds ", around young 
stars in the Orion nebula. 
The disk diameters are two 
to eight times the diameter 
of our solar system. There 
is a T Tauri star in the 
centre of each disk. (Mark 
McCaugh- 

rean/Max-Planck-Institute 
for Astronomy, C. Robert 
O’Dell/Rice University, 
and NASA) 



samples are somewhat older. When all the facts 
are put together, it can be estimated that the Earth 
and other planets were formed about 4.56 x 10 9 
years ago. On the other hand, the age of the 
Galaxy is at least twice as high, so the overall 
conditions have not changed significantly during 
the lifetime of the solar system. Moreover, there 
is even direct evidence nowadays, such as other 
planetary systems and protoplanetary disks, pro- 
plyds (Fig. 7.24). 

The Sun and practically the whole solar sys¬ 
tem simultaneously condensed from a rotating 
collapsing cloud of dust and gas, the density of 
which was some 10,000 atoms or molecules per 
cm 3 and the temperature 10-50 K. The elements 
heavier than helium were formed in the interiors 
of stars of preceding generations, as will be ex¬ 
plained in Sect. 12.8. The collapse of the cloud 
was initiated perhaps by a shock wave emanating 
from a nearby supernova explosion. 

The original mass of the cloud must be thou¬ 
sands of Solar masses to exceed the Jeans mass. 


When the cloud contracts the Jeans mass de¬ 
creases. Cloud fragments and each fragment con¬ 
tract independently as explained in later chapters 
of star formation. One of the fragments became 
the Sun. 

When the fragment continued its collapse, par¬ 
ticles inside the cloud collided with each other. 
Rotation of the cloud allowed the particles to sink 
toward the same plane, perpendicular to the rota¬ 
tion axis of the cloud, but prevented them from 
moving toward the axis. This explains why the 
planetary orbits are in the same plane. 

The mass of the proto-Sun was larger than 
the mass of the modern Sun. The flat disk in the 
plane of the ecliptic contained perhaps 1/10 of 
the total mass. Moreover, far outside, the rem¬ 
nants of the outer edges of the original cloud were 
still moving toward the centre. The Sun was los¬ 
ing its angular momentum to the surrounding gas 
by means of the magnetic field. When nuclear 
reactions were ignited, a strong solar wind car¬ 
ried away more angular momentum from the Sun. 
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The final result was the modern, slowly rotating 
Sun. 

Gravitational and viscous torques transferred 
the angular momentum outwards. The former 
means a density wave caused by the instability of 
the disk, transferring bot mass and angular mo¬ 
mentum outwards. Collisions between dust parti¬ 
cles increased the velocities of outer particles and 
slowed down inner particles. Thus most particles 
moved inwards but the angular momentum out¬ 
wards and the disk spread out. 

Later, when nuclear reactions started, the 
strong solar wind transferred more angular mo¬ 
mentum. At this T Tauri stage the protosun lost 
mass as much as 10 -8 Mq /a in the form of solar 
wind. 

Collisions of the disk particles continued. Ini¬ 
tially individual particles stick together because 
of the weak intermolecular van der Waals forces. 
In less than 10,000 years the particle size in¬ 
creased from a few micrometres to millimetres. 
The growth rate was then proportional to the 
cross sections of the particles. 

When the particles became bigger the growth 
rate increased considerably and became propor¬ 
tional to the fourth power of the particle radius. 
The reason for this was that the weak gravitation 
of bigger particles started attract gas and dust. If 
the mass of a particle is M and radius R and the 
relative velocity of a dust particle Vo (Fig. 7.25) 
the effective cross section of collisions to the big¬ 
ger particle is s 2 : 


s 


2 



2 GMR\ 

Vo )' 


(7.50) 


Since M oc R 2 we have s 2 oc R 4 . 

The velocity of the gas was about 0.5 % 
smaller than the orbital velocity, and thus parti¬ 
cles moved faster than gas and swept away the 
gas and dust. This resulted in rapidly growing 
planetesimals , with diameters from a few metres 
to kilometres. 

Since big particles were moving faster than 
the gas they experienced a small friction slow¬ 
ing their velocity. The effect was strongest on me¬ 
tre size particles. Thus small planetesimals had to 
grow bigger in a few thousand years or drift down 
to the Sun. 


Fig. 7.25 If a particle 
passes a massive object at a 
close distance it will hit the 
larger body and increase its 
mass 



When the planetesimals collided (Fig. 7.26) 
they grew bigger but the growth rate was no more 
proportional to the fourth power of the radius but 
slower. When the planetesimals reached the size 
of planets their mutual gravitation became in¬ 
creasingly important. Collisions of planetesimals 
and protoplanets shaped the solar system until it 
to some extent looked like the current system. 
The formation of the Moon, the slow retrograde 
of Venus and the abnormal orientation of the ro¬ 
tation axis of Uranus were caused by collisions of 
objects of the size of Mars. 

The formation of lupiter and Saturn took 
about 10 3 —10 6 years, terrestrial planets 10 6 -10 7 
years, Uranus and Neptune 10 7 —10 s years. The 
Nice model (Sect. 7.12) suggests that originally 
Neptune was closer to the Sun than Uranus. 
Resonances caused Saturn, Uranus and Neptune 
to drift farther from the Sun, whence Neptune 
moved outside Uranus. Jupiter, on the other hand, 
moved closer to the Sun. 

The strong perturbations by Jupiter prevented 
the formation of a large planet between Mars and 
Jupiter. The objects in this asteroid belt are either 
planetesimals or shattered protoplanets. 

Depending on the volatility the matter of the 
solar system can be divided roughly into three 
categories: Gases, mainly hydrogen and helium, 
consisting of about 98.2 % of the total mass of 
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Fig. 7.26 A schematic 
plot on the formation of the 
solar system, (a) A large 
rotating cloud, the mass of 
which was 3-4 solar 
masses, began to condense, 
(b) The innermost part 
condensed most rapidly 
and a disk of gas and dust 
formed around the 
proto-sun. (c) Dust 
particles in the disk 
collided with each other 
forming larger particles 
and sinking rapidly to 
a single plane, (d) Particles 
clumped together into 
planetesimals which were 
of the size of present 
asteroids, (e) These clumps 
drifted together, forming 
planet-size bodies which 
began (f) to collect gas and 
dust from the surrounding 
cloud, (g) The strong solar 
wind “blew” away extra 
gas and dust; the planet 
formation was finished 
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Distance from the Sun [AU1 
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Fig. 7.27 Temperature distribution in the solar system 
during planet formation. The present chemical composi¬ 
tion of the planets reflects this temperature distribution. 
The approximate condensing temperatures of some com¬ 
pounds have been indicated 

the solar system and remaining gaseous until very 
close to the absolute zero. Ices , about 1.4%, melt¬ 
ing around 160 K at the pressure of the initial 
nebula. Rocks , about 0.4 %, melting over temper¬ 
atures exceeding 1000 K (Fig. 7.27). 

Planets from Mercury to Mars consist mainly 
of rocks. When they were born the temperature 
in that region was too high for gases and ices 
to remain bound to planets. In this region over 
99 % of the matter remained outside the plan¬ 
ets. The temperature distribution is seen in the 
chemical contents of the planets. At the distance 
of Mercury the temperature had decreased below 
1400 K, which meant that compounds of iron and 
nickel could condense from the nebula. In fact, 
they form about 60 % of the mass of Mercury. 
When we move outwards other elements become 
more abundant. At the distance of the Earth the 
temperature is about 600 K and near Mars only 
450 K. The mantle of the Earth contains about 
10 % of iron(II)oxide FeO. In Mars there is con¬ 
siderably more FeO, but in Mercury hardly any¬ 
thing at all. 

Table 7.5 gives the mass distribution of the 
solar system and Table 7.6 the minimum mass 
needed for the existing planets. This takes into ac¬ 
count the different composition of the planets and 
the Sun. In reality, the mass of the accretion disk 
must be much bigger, since not all of the mass did 
not end up in planets. 

Using the minimum mass we can also calcu¬ 
late the required density distribution of the ac¬ 


Table 7.5 Mass distribution of the solar system 



Part of the (%) total mass 

Sun 

99.80 

Jupiter 

0.10 

Comets 

0.05 

Other planets 

0.04 

Moons and rings 

0.00005 

Asteroids 

0.000002 

Dust 

0.0000001 


cretion disk. If the mass of the planet is M and 
it has accreted its material in the distance range 
(I/O, n]) from a disk whose density is p(r), we 
get 

/ r2n nr \ 

p(r)dA = I I p(r)rdrd6 

Jo J r$ 

= 2jt ( * p(r)rdr. (7.51) 

Jr 0 

The density profile of the disk seems to obey 
a r~ 2 law pretty well except in the asteroid belt, 
where these is a clear mass defect (Fig. 7.28). 

At the distance of Jupiter and Saturn the tem¬ 
perature was already so low that icy bodies could 
form. Some satellites of Saturn are examples of 
such bodies. From the surrounding cloud the gi¬ 
ant planets collected gas that could stay around 
the planets because they were relatively far from 
the Sun. lupiter and Saturn contain mostly hy¬ 
drogen and helium. In Uranus and Neptune the 
content of these gases is smaller, possibly around 
twenty percent. 

Continuous collisions of meteoroids, shrink¬ 
ing of the planets under their own gravity, and 
radioactive decay of relatively short lived nu¬ 
clei produced a lot of heat. Heating caused par¬ 
tial melting of planets, leading to differentiation: 
heavier elements sank down and lighter ones rose 
towards the surface. 

The bombardment continued for about half a 
billion years. Its effects are still seen on most 
solid bodies. For instance, the Lunar maria are 
remnants of that era. On the Earth the tectonic 
resurfacing and erosion have destroyed most me¬ 
teorite craters. 
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Table 7.6 Minimum mass of the primordial nebula sition consistent with the Sun. The Nice model will change 

needed for the planets. The factor is a value by which the the values of this table and Fig. 7.28 

mass of the planet has to be multiplied to make the compo- 



Distance [au] 

Mass Earth = 1 

Factor 

Total mass 

Cumulative mass 

Mercury 

0.4 

0.055 

350 

19.3 

19 

Venus 

0.7 

0.815 

270 

220.1 

239 

Earth + Moon 

1.0 

1.012 

235 

237.8 

477 

Mars 

1.5 

0.107 

235 

25.1 

502 

Asteroids 

2.8 

0.002 

200 

0.4 

503 

Jupiter 

5.2 

317.89 

5 

1589.5 

2092 

Saturn 

9.6 

95.17 

8 

761.4 

2853 

Uranus 

19.2 

14.56 

15 

218.4 

3072 

Neptune 

30.1 

17.24 

20 

344.8 

3417 

Pluto 

40 

0.005 

70 

0.4 

3417 


Fig. 7.28 Surface density 
[kg/rn 2 ] of the accretion 
disc as a function of 
distance. The density 
follows approximately an 
r~ 2 law. Especially around 
the asteroid belt there 
seems to be a mass deficit, 
indicating that a 
considerable amount of 
matter has been removed 
elsewhere. The vertical 
lines separate regions from 
which each planet has 
accreted its material 



Due to the perturbations by large planets the 
“leftover” planetesimals collided to planets or 
were thrown out to the outskirts of the solar sys¬ 
tem or even out to the interstellar space. What re¬ 
mained where mainly the asteroids currently on 
stable orbits. Lots of low-density objects, comets, 
were thrown to the outer regions of the solar sys¬ 
tem. These form the current Oort cloud. The total 
mass of the Oort cloud may be even 40 M© and 
it may contain billions of comets. 

Also the small bodies beyond the orbit of Nep¬ 
tune and the somewhat more distant Kuiper belt 
may have originated nearer to the Sun. 

Planetary formation ended when the nuclear 
reactions of the Sun started and the Sun entered 
its T Tauri stage (Sect. 14.3). The strong solar 
wind caused the Sun to lose mass and angular 
momentum. The mass loss was about 10 -6 Mq 


a year, yet altogether maybe less than 0.1 Mq. 
The solar wind blew away the gas dust still in the 
interplanetary space, and thus the planets could 
not accrete any more matter. 

The solar wind or radiation pressure has no 
effect on millimetre- and centimetre-sized parti¬ 
cles. However, they will drift into the Sun because 
of the Poynting-Robertson effect, first introduced 
by John P. Poynting in 1903. Later H.P. Robertson 
derived the effect by using the theory of relativity. 
When a small body absorbs and emits radiation, it 
loses its orbital angular momentum and the body 
spirals to the Sun. At the distance of the asteroid 
belt, this process takes only a million years or so. 
Therefore the meteors wee see nowadays must be 
much younger than the solar system. A relatively 
big fraction of them is material disrupted from 
comets. 
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7.12 Nice Models 

It has been assumed that the distances of the plan¬ 
ets have not changed much since they were born. 
The most essential feature of the Nice model is 
that the giant planets were born much closer to 
the Sun. The model is based on a large number 
of computer simulations, carried out in the Cote 
d’Azur observatory near Nice. 

The planets drifted to their current orbits due 
to their mutual gravity and resonances. At the 
same time planetary perturbations made the left¬ 
over material either to collide with planets or 
move outwards to the asteroid belt, Kuiper belt 
and Oort cloud. 

The original Nice model could not properly 
explain all properties of the solar system, like the 
structure of the Kuiper belt. These problems were 
fixed in the newer Nice model 2. 

There are several versions of the models with 
varying details. They seem to approach the cor¬ 
rect explanation, it may still be too early to dis¬ 
cuss here their rather complicated details before 
they have been confirmed more convincingly. 


Box 7.1 (Tides) Let the tide generating body, 
the mass of which is M to be at point Q at a dis¬ 
tance cl from the centre of the Earth. The poten¬ 
tial V at the point A caused by the body Q is 


V(A) = 


GM 


( 1 ) 


where s is the distance of the point A from the 
body Q. 


/ r s\ 

S 


f o j- 

d 

Q 


Applying the cosine law in the triangle 
OAQ, the distance .v can be expressed in terms 
of the other sides and the angle z = AO Q 

s 2 = dr + r 2 — 2dr cos z, 

where r is the distance of the point A from the 
centre of the Earth. We can now rewrite ( 1 ) 


V (A) = 


GM 


V d 2 +r 2 —2 drcosz 


( 2 ) 


When the denominator is expanded into a Tay¬ 
lor series 


a+*r* 


1 -x H —x 


where 

9 

r r 

x = -pr — 2- cos z 
d 

and ignoring all terms higher than or equal to 
1 / d 4 one obtains 


V (A) = 


GM GM 


d d 2 
GMr 2 1 


-r cosz 


d 3 


-(3cos 2 z-l). (3) 


The gradient of the potential V(A) gives 
a force vector per mass unit. The first term of 
(3) vanishes, and the second term is a constant 
and independent of r. It represents the central 
motion. The third term of the force vector, how¬ 
ever, depends on r. It is the main term of the 
tidal force. As one can see, it depends inversely 
on the third power of the distance d. The tidal 
forces are diminished very rapidly when the 
distance of a body increases. Therefore the tidal 
force caused by the Sun is less than half of that 
of the Moon in spite of much greater mass of 
the Sun. 

We may rewrite the third term of (3) as 


V 2 = 2D 


^cos 2 z 



(4) 


where 



is called Doodson’s tidal constant. It’s value 
for the Moon is 2.628 m 2 s -2 and for the Sun 
1.208 m 2 s -2 . We can approximate that z is the 
zenith angle of the body. The zenith angle z can 
be expressed in terms of the hour angle h and 
declination 5 of the body and the latitude 0 of 
the observer 


cos z = cos h cos <5 cos 0 + sin S sin 0. 
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Inserting this into (4) we obtain after a lengthy 
algebraic operation 

V 2 = D ^cos 2 0 cos 2 <5 cos 2h 
+ sin 20 cos 28 cos h 
± (3 sin 2 0 — l) ^sin 2 8 — 

= D(S + T + Z). (5) 

Equation (5) is the traditional basic equation of 
the tidal potential, the Laplace’s tidal equation. 

In (5) one can directly see several charac¬ 
teristics of tides. The term S causes the semi¬ 
diurnal tide because it depends on cos 2li. It has 
two daily maxima and minima, separated by 12 
hours, exactly as one can obtain in following 
the ebb and flood. It reaches its maximum at 
the equator and is zero at the poles (cos 2 0). 

The term T expresses the diurnal tides 
(cos h ). It has its maximum at the latitude ±45° 
and is zero at the equator and at the poles 
(sin 20). The third term Z is independent of the 
rotation of the Earth. It causes the long period 
tides, the period of which is half the orbital pe¬ 
riod of the body (about 14 days in the case of 
the Moon and 6 months for the Sun). It is zero 
at the latitude ±35.27° and has its maximum 
at the poles. Moreover, the time average of Z 
is non-zero, causing a permanent deformation 
of the Earth. This is called the permanent tide. 
It slightly increases the flattening of the Earth 
and it is inseparable from the flattening due to 
the rotation. 

The total value of the tidal potential can be 
computed simply adding the potentials caused 
by the Moon and the Sun. Due to the tidal 
forces, the whole body of the Earth is de¬ 
formed. The vertical motion A r of the crust can 
be computed from 

V 2 

A r — h — <=» 0.06 V 2 [m], (6) 

8 

where g is the mean free fall acceleration, g *=» 
9.81 ms -2 and h is a dimensionless number, 
the Love number, li ~ 0.6, which describes the 


elasticity of the Earth. In the picture below, 
one can see the vertical motion of the crust in 
Helsinki, Finland (0 = 60°, A. = 25°) in Jan¬ 
uary 1995. The non-zero value of the temporal 
mean can already be seen in this picture. 



The tides have other consequences, too. Be¬ 
cause the Earth rotates faster than the Moon or¬ 
bits the Earth, the tidal bulge does not lie on 
the Moon-Earth line but is slightly ahead (in 
the direction of Earth’s rotation), see below. 



Due to the drag, the rotation of the Earth 
slows down by about 1-2 ms per century. The 
same reason has caused the Moon’s period 
of rotation to slow down to its orbital period 
and the Moon faces the same side towards the 
Earth. The misaligned bulge pulls the Moon 
forward. The acceleration causes the increase 
in the semimajor axis of the Moon, about 3 cm 
per year. 


7.13 Examples 

Example 7.1 (Sidereal and Synodic Period) The 
time interval between two successive oppositions 
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of Mars is 779.9 d. Calculate the semimajor axis 
of Mars’ orbit. 

The synodic period is 779.9 d = 2.14 years. 
We obtain from (7.2) 

where P — 1 d = 24 h. The total energy is thus 

fho 

W = I So(sin<5 sin0 + cos 5 cos0 cosh) 
J-ho 

^ = 1-214= °' 53 => P2=L88a ' 

p 

x — dli 

2 n 

s 0 p 

= - (ho sin 8 sin0 + cos S cos 0 sin ho), 

By using Kepler’s third law (m <C Mq), the semi¬ 
major axis is found to be 

TV 

where 

a = P 2 ' 3 = 1.88 2/3 = 1.52 au. 

ho = arccos(— tan <5 tan 0). 

Example 7.2 (Solar Energy Flux on the Earth) 
Calculate the diurnal solar energy flux per unit 
area at the distance of the Earth. 

The solar flux density outside the Earth’s at¬ 
mosphere (the solar constant) is So = 1370 W/m 2 . 
Consider a situation at latitude 0, when the solar 
declination is 5. If the atmospheric extinction is 
neglected, the flux density on the surface is 

For example near the equator (0 = 0°) cos ho = 0 
and 

W(0 = O°) = — cos 5. 

TV 

At those latitudes where the Sun will not set, ho = 
tv and 

Wcirc = S , o7’sin 1 5sin0. 

Near the poles, the Sun is always circumpolar 
when above the horizon, and so 

S = So sinu, 

W(<p — 90°) = SoP sin<5. 

where a is the elevation of the Sun. We can write 
sin a as a function of latitude, declination, and 
hour angle h: 

Interestingly enough, during the summer when 
the declination of the Sun is large, the polar ar¬ 
eas receive more energy than the areas close to 
the equator. This is true when 

sin a = sin 8 sin 0 + cos S cos 0 cos h. 

W(if> = 90°) > W(0 = 0°) 

On a cloudless day, the energy is received be¬ 
tween sunrise and sunset. The corresponding 
hour angles can be obtained from the equation 
above, when a = 0: 

O SqP sin5 > SqPcosS/tt 

4^ tan S >\/tv 

4> & > 17.7°. 

cos/io = — tan <5 tan 0. 

The declination of the Sun is greater than this 
about two months every summer. 

In the course of one day, the energy received on 
a unit area is 

However, atmospheric extinction diminishes 
these values, and the loss is at its greatest at the 
poles, where the elevation of the Sun is always 

fh o 

W= / Sdr. 

J-ho 

relatively small. Radiation must penetrate thick 
layers of the atmosphere and the path length is 
comparable to 1/sina. If it is assumed that the 

The hour angle h is expressed in radians, so the 
time t is 

fraction k of the flux density reaches the surface 
when the Sun is at zenith, the flux density when 
the Sun is at the elevation a is 

h 

t = — P, 

2jt 

S' = S 0 smak 1/sina . 
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The total energy received during one day is thus 

/ ho fho 

S'dt = I So sina i 1 / 81 ”" dr. 

-ho J -ho 

This cannot be solved in a closed form and nu¬ 
merical methods must be used. 

The figure on next page shows the daily re¬ 
ceived energy W [kW h/m 2 ] during a year at lat¬ 
itudes 0 = 0°, 60°, and 90° without extinction, 
and when k = 0.8, which is close to the real value. 


Example 7.3 (Magnitude of a Planet) The ap¬ 
parent magnitude of Mars during the 1975 op¬ 
position was m\ = —1.6 and the distance to the 
Sun, r\ = 1.55 au. During the 1982 opposition, 
the distance was r 2 — 1.64 au. Calculate the ap¬ 
parent magnitude in the 1982 opposition. 

At opposition, the distance of Mars from the 
Earth is A = r — 1. The observed flux density de¬ 
pends on the distances to the Earth and the Sun, 


F oc 


1 


' r 2 A 2 ' 

Using the magnitude formula (4.9) we obtain 

r 2 (r 2 - l) 2 

m \ - m 2 = -2.5 lg \ -— 

r{(r\ - 1 Y 

n (r 2 — 1) 

=> m 2 = m\ + 51g —--— 

n(n - 1 ) 


= - 1 . 6 - 


1.64x0.64 

- 5 lg- 

5 1.55 x 0.55 


The same result is obtained if (7.38) is separately 
written for both oppositions. 


Example 7.4 (The Brightness of Venus) Find 
the instant when Venus is brightest if the bright¬ 
ness is proportional to the projected size of the 
illuminated surface. The orbits are assumed to be 
circular. 

The size of the illuminated surface is the area 
of the semicircle ACE± half the area of the el¬ 
lipse ABCD. The semiaxes of the ellipse are 
R and R cos a. If the radius of the planet is R, 
the illuminated area is 


R 1 


7T 


7 T — + -jtR x R cos a = — R (1 + cos a), 


where a is the phase angle. The flux density is 
inversely proportional to the square of the dis¬ 
tance A. Thus 


F oc 


1 + cos a 
A 2 


A 




The cosine formula yields 

= r 2 + A 2 — 2 Ar cos a. 

When cos a is solved and inserted in the flux den¬ 
sity we obtain 


2A r + r 1 + A 2 - M 2 

F oc- 5 - 

2r A 3 

The minimum of the equation yields the distance 
where Venus is brightest: 

3 F _ 4rA + 3r 2 -3M 2 + A 2 _ n 
3A ~ 2rA 4 ~ 

=>• A = -2 r± Jr 2 + 3Ml. 

If r — 0.723 au and R © = 1 au, the distance is 
A = 0.43 au and the corresponding phase angle 
is a = 118°. 
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Thus Venus is brightest shortly after the largest If T — 273 K, we the distance is r = 1.4 au; when 
eastern elongation and before the largest western T = 373 K, we get r = 0.8 au. 
elongation. From the sine formula we obtain 


sine sin a 


r M© 

The corresponding elongation is e = 40°, and 
1 + cos a 

- x 100 % = 27 % 


of the surface is seen lit. 


7.14 Exercises 

Exercise 7.1 What is the greatest possible elon¬ 
gation of Mercury, Venus and Mars? How long 
before sunrise or after sunset is the planet visible? 
Assume that the declination of the planet and the 
Sun is 8 — 0°. 


Example 7.5 (Magnitude of an Asteroid) The 
parameters of the asteroid 44 Nysa are H — 
6.929, G\ — 0.050, G 2 = 0.67 and the semima¬ 
jor axis of the orbit a = 2.42 au. Find V(l, 1°). 
What is the apparent magnitude at phase angles 
0° and 1°? 

Values of the basis functions are <Ji(l°) = 
0.9667, <Z> 2 (1°) = 0.9900, tf> 3 (l°) = 0.577. Us¬ 
ing these we get the absolute magnitude at the 
phase angle 1 0 : 

V(l, 1°) = 6.929- 2.5lg[0.050 x 0.9667 
+ 0.67 x 0.990 
+ (1 -0.050-0.67) x 0.577] 

= 7.076. (7.52) 


Exercise 7.2 (a) What is the greatest possible 
geocentric latitude of Venus, i.e. how far from the 
Sun can the planet be at the inferior conjunction? 
Assume the orbits are circular. 

(b) When is the situation possible? The longi¬ 
tude of the ascending node of Venus is 77°. 

Exercise 7.3 (a) Find the daily retrograde appar¬ 
ent motion of an exterior planet at its opposition. 
Assume that the planet and the Earth have circu¬ 
lar orbits. 

(b) Pluto was found in 1930 from two plates, 
exposed 6 days apart during the opposition of the 
planet. On those plates one degree corresponded 
to 3 cm. How much (in cm) had Pluto moved be¬ 
tween the exposures? How much does a typical 
main belt asteroid move in the same time? 


Near the opposition we can approximate A = 
r — 1=1.42 au. The apparent magnitude at the 
opposition is then 

m = 6.929 + 5 lg( 1.42 x 2.42) - 2.5 lg<P(0°) 

= 6.929 + 5 lg 3.36 = 9.561. (7.53) 

When the phase angle is 1° the absolute magni¬ 
tude is 0.147 greater than at the opposition. The 
apparent magnitudes differ by the same amount; 
hence m(a = 1°) = 9.71. 


Example 7.6 Find the distance of a comet from 
the Sun when its temperature reaches 0 °C and 
100 °C. Assume the Bond albedo of the comet 
is 0.05. 

Solve r from (7.47): 


r — 



1/2 

Rq. 


Exercise 7.4 A planet is observed at the oppo¬ 
sition or inferior conjunction. Due to the finite 
speed of light the apparent direction of the planet 
differs from the true place. Find this difference as 
a function of the radius of the orbit. You can as¬ 
sume the orbits are circular. Which planet has the 
largest deviation? 

Exercise 7.5 The angular diameter of the Moon 
is 0.5°. The full moon has an apparent magnitude 
of —12.5 and the Sun —26.7. Find the geomet¬ 
ric and Bond albedos of the Moon, assuming that 
the reflected light is isotropic (into a solid angle 
2 jt sterad). 

Exercise 7.6 The eccentricity of the orbit of 
Mercury is 0.206. How much does the apparent 
magnitude of the Sun vary as seen from Mercury? 
How does the surface brightness of the Sun vary? 
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Exercise 7.7 An asteroid with a diameter of What do you think about the chances of finding 
100 m approaches the Earth at a velocity of the asteroid well in advance the crash? 

30 kms -1 . Find the apparent magnitude of the 

asteroid (a) one week, (b) one day before the col- Exercise 7.8 Find the centripetal acceleration at 
lision. Assume that the phase angle is a — 0° and the poles and on the equator of the Earth, 
the geometric albedo of the asteroid is p — 0.1. 
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Our solar system contains eights planets orbit¬ 
ing the Sun, dwarf planets, asteroids, comets and 
meteors, as well satellites orbiting planets and 
smaller ring particles. In this chapter the proper¬ 
ties of these different object are discussed. 


8.1 Mercury 

Mercury is the innermost planet of the solar sys¬ 
tem. Its diameter is 4800 km. Mercury is always 
found in the vicinity of the Sun; its maximum 
elongation is only 28°. Observations are diffi¬ 
cult because Mercury is always seen in a bright 
sky and close to the horizon. Mercury has phases 
like the Moon. When it is closest to the Earth 
in the inferior conjunction, the dark side of the 
planet is toward us. And when the whole illumi¬ 
nated hemisphere is towards the Earth Mercury 
is behind the Sun and farthest from us. A couple 
of times in a century Mercury transits the solar 
disk (Sect. 7.5). Observations during transits have 
shown that Mercury does not have an appreciable 
atmosphere. 

The first maps of Mercury were drawn at the 
end of the 19th century but the reality of the de¬ 
tails was not confirmed. As late as in the begin¬ 
ning of the 1960’s, it was believed that Mercury 
always turns the same side toward the Sun. How¬ 
ever, measurements of the thermal radio emission 
showed that the temperature of the night side is 
too high, about 100 K, instead of almost absolute 
zero. Finally, the rotation period was established 
by radar. 


Fig. 8.1 Length of day in Mercury. The positions of Mer¬ 
cury during the first revolution are shown outside the el¬ 
lipse. Upon returning to the aphelion, the planet has turned 
540° (1 ^ revolutions). After two full cycles the planet has 
rotated three times around its axis and the same side points 
toward the Sun. The length of the day is 176 d. longer than 
on any other planet 

One revolution around the Sun takes 88 days. 
The rotation period is two-thirds of this, 59 days. 
This means that every second time the planet is 
in, say, perihelion, the same hemisphere faces the 
Sun (Fig. 8.1). This kind of spin-orbit coupling 
can result from tidal forces exerted by a central 
body on an object moving in a fairly eccentric or¬ 
bit. 

Re-examination of old observations revealed 
why Mercury had been presumed to rotate syn¬ 
chronously. Owing to its geometry, Mercury is 
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Fig. 8.2 Mercury imaged by the Messenger probe. The 
centre of the upper mosaic image is at the intersection 
of the equator and meridian of the planet. The ray crater 
Debussy is near the lower edge. The mosaic image on 
the opposing page shows the opposite hemisphere. The 
circled area is the over 1500 km wide Caloris basin. 
Below it a more easily distinguished 225 km wide cir¬ 
cular depression is the Mozart basin. (NASA/Johns Hop- 

easiest to observe in spring and autumn. In six 
months. Mercury orbits twice around the Sun, ro¬ 
tating exactly three times around its own axis. 
Consequently, during observations, the same side 
was always facing the Sun! The details visible on 
the surface are very obscure and the few excep¬ 
tional observations were interpreted as observa¬ 
tional errors. 

Since the rotation period of Mercury is r* = 
58.6 d ans the orbital period P — 87.97 d, (2.43) 
shows that the length of day is r = 176 d, or two 
Mercury’s years. The rotation axis is almost per¬ 
pendicular to the orbital plane. 

The mean distance from the Sun is 0.39 au. 
The eccentricity of the orbit is 0.21, which means 


kins University Applied Physics Laboratory/Camegie In¬ 
stitution of Washington). Below: A detail of the surface 
photographed by Mariner 10 during it’s first encounter 
with Mercury in 1974. The scarp is about 350 kilome¬ 
tres long and transects two craters 35 and 55 kilometres 
in diameter. It is up to 2 km high in some places and it ap¬ 
pears to be a fault produced by compression of the crust. 
(NASA/JPL/Northwestem University) 

that the distance varies between 0.31 and 0.47 au. 
Because of the high eccentricity, the surface tem¬ 
perature of the subsolar point varies substantially: 
at the perihelion, the temperature is about 700 K; 
at the aphelion, it is 100 K lower. Temperature 
variations on Mercury are the most extreme in the 
solar system because in the night side the temper¬ 
ature drops below 100 K. 

The precession of the perihelion of Mercury 
is more than 0.15° per century. When the New¬ 
tonian perturbations are subtracted, there remains 
an excess of 43". This is fully explained by the 
general theory of relativity. The explanation of 
the perihelion precession was one of the first tests 
of the general theory of relativity. 
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Fig. 8.2 (Continued) 
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The first spacecraft studying Mercury was the 
US Mariner 10 that passed Mercury three times 
in 1974 and 1975. The orbital period of Mariner 
10 around the Sun was exactly twice the period 
of Mercury. The two-thirds-factor meant that the 
same side of the planet was illuminated during 
every fly-by! The other side remained unknown. 
It was only in 2004 that the Messenger (MErcury 
Surface, Space ENvironment, GEochemistry, and 
Ranging) mapped the whole surface of Mercury, 
first on three fly-bys in 2008-2009 and from an 
orbit around Mercury in 2011-2015. The next 
probe will be the BepiColombo built by the Eu¬ 
ropean Space Agency ESA. The launch is sched¬ 
uled for 2017, and after several fly-bys it should 
settle on an orbit around Mercury in 2024. 

Since Mercury has no satellites, precise values 
of its mass and density could be calculated only 
after Mariner’s flight. 

The Mariner 10 data revealed a moon-like 
landscape. Mercury’s surface is covered by im¬ 
pact craters (Fig. 8.2), indicating that the surface 
is old and undisturbed by continental drift or vol¬ 
canic eruptions. There are some signs of volcanic 
activity, but they are very old, possibly over a bil¬ 
lion years. 

There are also some circular formations re¬ 
sembling lunar maria, formed by impacts of 
larger object and filled by lava seeping from the 
interior of the planet. The largest circular area is 
the 1500 km wide Caloris Basin. The shock wave 
produced by the Caloris impact was focused to 
the antipodal point, breaking the crust into com¬ 
plex blocks in a large area, the diameter of which 
is hundreds of kilometers. This region is named 
the Weird Terrain. 

There are also faults that were possibly pro¬ 
duced by compression of the crust. The volume 
change probably was due to the cooling of the 
planet. 

Mercury’s relatively small size and proxim¬ 
ity to the Sun, resulting in low gravity and high 
temperature, are the reasons for its lack of atmo¬ 
sphere. There is a layer made up of atoms blasted 
off the surface by the solar wind. The tenuous “at¬ 
mosphere” is composed mainly of oxygen, nitro¬ 
gen, and helium. The atoms quickly escape into 
space and are constantly replenished. 


Due to the absence of an atmosphere, the tem¬ 
perature on Mercury drops very rapidly after sun¬ 
set. The rotational axis is almost perpendicular 
to the orbital plane; therefore it is possible that, 
close to the poles, there are areas where the tem¬ 
perature is permanently below the freezing point. 
Radar echos from the surface of Mercury show 
several anomalously reflective and highly depo¬ 
larised features at the north and south poles. 
Some of these areas can be addressed to the 
craters, the bottoms of which are permanently in 
shadow. One candidate of the radar-bright fea¬ 
tures is water ice that has survived in the perma¬ 
nent shadow. 

Existence of ice was confirmed by the Mes¬ 
senger probe. At the bottoms of some craters the 
temperature never exceeds 100 K. The ice is cov¬ 
ered by regolith that prevents the ice from subli¬ 
mating and evaporating to space. It has been esti¬ 
mated that the total amount of ice could be even 
1/1000 of the ice in the Antarctic. 

It has been said that Mercury looks like the 
Moon from the outside but is terrestrial from the 
inside. According to theoretical models, the in¬ 
ternal structure is similar to that of the Earth but 
the core is substantially larger. The density of the 
planet is about the same as that of the Earth, in¬ 
dicating that the size of the Fe-Ni core is roughly 
about 75 % of the planet’s radius. The thickness 
of the mantle is only 500-700 km and that of the 
crust 100-300 km. 

Due to the vicinity of the Sun, the temperature 
of the primeval nebula at the distance of Mercury 
was quite high during planetary formation. Thus 
the relative abundances of the volatile elements 
are smaller than on any other terrestrial planet. 

The Sun creates strong tides on Mercury, with 
main periods of 44 and 88 days. Measuring the 
tidal variation is difficult due to its slowness, but 
it has bee calculated that the vertical motion near 
the equator could be a couple of meters. (The 
crust of the Earth moves about 30 cm due to the 
lunar tides.) 

Mercury has a weak magnetic field, about 1 % 
as strong as that of the Earth. The presence of 
the magnetic field is unexpected because Mer¬ 
cury is much smaller than the Earth and it rotates 
slowly. According to the dynamo theory, a mag- 
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Fig. 8.3 The phases of Venus were discovered by Galileo Galilei in 1610. This drawing illustrates how the apparent 
size of Venus changes with phase. The planet is far behind the Sun when the illuminated side faces the Earth 


netic field is generated by flows in a liquid, elec¬ 
trically conducting core. The magnetic field can¬ 
not be a remnant from ancient times, since the 
internal temperature of the planet must have ex¬ 
ceeded the critical Curie point. Therefore, it must 
be assumed that a part of the core is molten. 
Possibly the deformations caused by tides and 
the friction releasing heat keep the core molten 
and maintain mass flows generating the magnetic 
field. 


8.2 Venus 

Venus is the brightest object in the sky, after the 
Sun and the Moon. Like Mercury, Venus can be 
seen only in the morning or in the evening sky. 
Sometimes it is possible to see Venus even if the 
Sun is above the horizon, if its exact position 
is known. In antiquity, Venus was thought to be 
two different planets, Hesperos and Phosphorus , 
evening star and morning star. 

The maximum elongation of Venus is about 
47°. Venus is a remarkable object when shining 
in the dark sky at its brightest, 35 days before or 
after the inferior conjunction, when one-third of 
the surface is seen lit (Fig. 8.3). 

At the inferior conjunction, the Earth-Venus 
distance is only 42 million km. The diameter 
of Venus is about 12,000 km, which means that 
the apparent diameter can be as large as one arc 
minute. Under favourable conditions it is even 
possible to see the shape of the crescent Venus 
with binoculars. At the superior conjunction, the 
apparent diameter is only 10 arc seconds. 

Venus is covered by clouds. Its surface is 
nowhere visible; only featureless yellowish cloud 
tops can be seen (Fig. 8.4). Therefore, even the 
rotation period was long unknown. 


The chemical composition of the Venusian 
atmosphere was known prior to the space age. 
Spectroscopic observations revealed CO 2 , but no 
oxygen. About 98 % of the atmosphere is car¬ 
bon dioxide. Some clues about the cloud compo¬ 
sition were obtained from polarimetric observa¬ 
tions. The famous French planetary astronomer 
Bernard Lyot made polarimetric observations in 
the 1920’s, but not until decades later was it re¬ 
alised that his observations could be explained by 
assuming that light was scattered by liquid spher¬ 
ical particles whose index of refraction is 1.44. 
This is significantly higher than the index of re¬ 
fraction of water, 1.33. Moreover, water is not 
liquid at that temperature. A good candidate was 
sulphuric acid FI 2 SO 4 . Later, spacecraft con¬ 
firmed this interpretation. 

In 1962, radar measurements revealed that the 
rotation period is 243 days in a retrograde direc¬ 
tion, i.e. opposite to the rotation of other planets. 
The axis of rotation is almost perpendicular to the 
orbital plane; the inclination is 177°. The reason 
for the strange rotation is not known. 

The temperature at the cloud tops is about 
250 K. Because the Bond albedo is as high 
as 75 %, the surface temperature was believed 
to be moderate, even suitable for life. Opinions 
changed dramatically when thermal radio emis¬ 
sion was measured at the end of the 1950’s. This 
emission originates on the surface of the planet 
and can penetrate the clouds. The surface tem¬ 
perature turned out to be 750 K, well above the 
melting point of lead. The reason for this is the 
greenhouse effect. The outgoing infrared radia¬ 
tion is blocked by atmospheric carbon dioxide. 
The pressure of the atmosphere at the surface is 
90 atm. 





186 


8 Objects of the Solar System 



Fig. 8.4 Left: Venus in visible light imaged by the Galileo 
orbiter in February 1990. The cloud features are caused 
by winds that blow from east to west at about 100 m/s. 
Right: The northern hemisphere of Venus in a Computer- 

Mariner 2 (1962) was the first spacecraft to 
encounter the planet. Five years later, the Soviet 
Venera 4 sent the first data from below the clouds, 
and the first pictures of the surface were sent by 
Venera 9 and 10 in 1975. The first radar map was 
completed in 1980, after 18 months of mapping 
by the US Pioneer Venus 1. The best and the 
most complete maps (about 98 % of the planet’s 
surface) were made using the synthetic aperture 
radar observations of the Magellan spacecraft in 
1990-1994. The resolution of the maps is as high 
as 100 m and the elevation was measured with 
a resolution of 30 metres. The Venus Express 
launched by ESA studied Venus from an orbit 
during 2006-2014. 

Radar mappings revealed canyons, mountains, 
craters, volcanoes and other volcanic formations 
(Fig. 8.5). The surface of Venus is covered by 
about 20 % of lowland plains, 70 % of gently 
rolling uplands and lava flows, and 10 % of high¬ 
lands. 

There are only few major highland areas. 
The largest continent, Aphrodite Terra , is close 
to the equator of Venus; its size is similar to 
South America. Another large continent at the 
latitude 70°N is called Ishtar Terra, where the 


generated picture of the radar observations. The north pole 
is at the centre of the image of the Magellan synthetic 
aperture radar mosaic. (NASA/JPL) 


highest mountain on Venus, the 12 km high 
Maxwell Montes is situated. (IAU has decided 
that the Venusian nomenclature has to be femi¬ 
nine. Maxwell Montes, after the famous physicist 
James Clerk Maxwell, is an exception.) 

On Venus there are volcanic features all over 
the surface. There is no evidence of massive tec¬ 
tonic movement although local deformations can 
exist. Small volcanoes are evenly distributed but 
big ones are concentrated mainly on highlands. 
The origin of the latter may resemble that of the 
Hawaiian volcanoes: the volcanoes are located 
above a “hot spot” of the mantle, in a place where 
currents of the mantle move hot magma towards 
the surface. 

Venus has more volcanoes than any other 
planet in the solar system. Over 1500 major vol¬ 
canoes or volcanic features are known, and there 
may even be one million smaller ones. Most are 
shield volcanoes, but there are also many com¬ 
plex features. None are known to be active at 
present, although large variations of sulphur diox¬ 
ide in the atmosphere may indicate that some vol¬ 
canoes are active. 

Almost all volcanism on Venus seems to in¬ 
volve fluid lava flows without any explosive erup- 
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Fig. 8.5 Surface features of Venus. (Top left): A Magel¬ 
lan image of a 50 km peak-ring crater Barton at 27.4°N 
and 337.5°E. (Top right): A Magellan radar image of a re¬ 
gion 300 km across, located in a vast plain to the south 
of Aphrodite Terra. The large circular structure near the 
centre of the image is a corona, approximately 200 km in 


diameter. North of the corona is a 35 km flat-topped vol¬ 
canic construct known as a pancake dome. Complex frac¬ 
ture patterns like in the upper right of the image are often 
observed in association with coronas and various volcanic 
features. (NASA/JPL). (Bottom): The surface of Venus 
photographed by the Venera 14 lander in March 1982 


tions. Due to the high air pressure, Venusian lavas 
need a much higher gas content than the Earth 
lavas to erupt explosively. The main gas driv¬ 
ing lava explosions on the Earth is water vapour, 
which does not exist on Venus. 

Flat-topped volcanic constructs known as pan¬ 
cake domes are probably formed by the eruption 
of an extremely viscous lava. A corona is a cir¬ 
cular trench surrounding an elevated plain, the 
diameter of which can be as big as several hun¬ 
dreds of kilometres. They are possibly examples 
of local hot spots, mantle upwellings that have 
expanded and formed bulges. When the flow has 
stopped, the bulge has sunk and formed a set of 
ring mountains. 

In other places fluid lava flows have produced 
long, sinuous channels extending for hundreds of 
kilometres. 

Venus has no large scale tectonic activity. The 
reason may be the thinner and weaker crust and 


lack of water; therefore there are no large subdue- 
tion zones between continents. Instead, the crust 
can consist of local zones only. Since the inter¬ 
nal heat cannot flow out like on the Earth, the 
crust will warm up until at a critical temperature 
it will collapse and be renewed in hundred mil¬ 
lion years. The last time this happened was about 
500 million years ago. 

Most of the Venusian impact craters are un¬ 
deformed. This indicates that the Venusian sur¬ 
face must be young because erosion, volcan- 
ism and tectonic forces should affect the craters, 
too. Resurfacing processes may frequently cover 
the old craters, and all craters visible are there¬ 
fore young, presumably less than 500 million 
years. There are no impact crates smaller than 
a few kilometers because smaller meteoroids are 
burned in the thick atmosphere. 

The Earth and Venus are almost equal in size, 
and their interiors are assumed to be similar. 
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Venus has an iron core about 3000 km in radius 
and a molten rocky mantle covering the majority 
of the planet. Probably due to its slow rotation, 
however, Venus has no magnetic field. The anal¬ 
yses made by the Venera landers have shown that 
the surface material of Venus is similar to terres¬ 
trial granite and basalt. 

About 1 % of the incident light reaches the 
surface of Venus; this light is deep red after trav¬ 
elling through clouds and the thick atmosphere. 
Most of the incident light, about 75 %, is reflected 
back from the upper layers of clouds. The ab¬ 
sorbed light is emitted back in infrared. The car¬ 
bon dioxide atmosphere very effectively prevents 
the infrared radiation from escaping, and the tem¬ 
perature had not reached the equilibrium until at 
750 K. 

In spite of the thick layer of clouds, the visibil¬ 
ity on the surface is several kilometres, and even 
in the clouds, a few hundred metres. Therefore, 
the clouds are relatively transparent. The densest 
clouds are at a height of 50 km, but their thick¬ 
ness is only 2-3 km. Above this, there are haze¬ 
like layers which form the visible “surface” of the 
planet. The uppermost clouds move rapidly; they 
rotate around the planet in about 4 days, pushed 
by strong winds powered by the Sun. The flow 
patterns of the clouds are best visible in ultravio¬ 
let light. The sulphuric acid droplets do not rain 
on the Venusian surface but they evaporate in the 
lower atmosphere before reaching the surface. 

Venus’ atmosphere is very dry: the amount of 
water vapour present is only 1/1,000,000 of that 
in the Earth’s atmosphere. One possible explana¬ 
tion is that, due to solar UV radiation, the water 
has dissociated to hydrogen and oxygen in the up¬ 
per layers of the atmosphere, the former escaping 
into interplanetary space. 

Venus has no satellites. 


8.3 The Earth and the Moon 

The third planet from the Sun, the Earth, and its 
satellite, the Moon, form almost a double planet. 
The relative size of the Moon is larger than that 
of any other satellite, excluding the moon of the 
dwarf planet Pluto. Usually satellites are much 
smaller than their parent planets. 
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Fig. 8.6 Internal structure of the Earth. The speed of 
the seismic waves, density, pressure, and temperature are 
shown as a function of depth. The crust, lithosphere, lower 
mantle and inner core are solid, the outer mantle is molten 
and the asthenosphere is partly molten 

The Earth is a unique body, since a consid¬ 
erable amount of free water is found on its sur¬ 
face. This is possible only because the temper¬ 
ature is above the freezing point and below the 
boiling point of water and the atmosphere is thick 
enough. The Earth is also the only planet where 
life is known to exist. (Whether it is intelligent or 
not is yet to be resolved...). The moderate tem¬ 
perature and the water are essential for terrestrial 
life, although some life forms can be found in ex¬ 
treme conditions. 

The diameter of the Earth is 12,000 km. At the 
centre, there is an iron-nickel core where the tem¬ 
perature is 5000 K, the pressure 3 x 10 11 Nm -2 
and the density 12,000 kg m“ 3 (Fig. 8.6). 

The core is divided into two layers, inner and 
outer core. The inner core, below 5150 km com¬ 
prises only of 1.7 % of the mass of the Earth. It is 
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solid because of high pressure. The nonexistence 
of the seismic transverse S waves below a depth 
of 2890 km indicates that the outer core is molten. 
However, the speed of the longitudinal P waves 
change rapidly at a depth of 5150 km showing an 
obvious phase transition. It has been discovered 
that the solid inner core rotates with respect to 
the outer core and mantle. 

The outer core comprises about 31 % of the 
mass of the Earth. It is a hot, electrically conduct¬ 
ing layer of liquid Fe-Ni where the convective 
motions take place. There are strong currents in 
the conductive layer that are responsible for the 
magnetic field. 

Between the outer core and the lower mantle 
there is a 200 km thick transition layer. Although 
this D" layer is often included as a part of the 
lower mantle, seismic discontinuities suggest that 
it might differ chemically from the lower mantle. 

A silicate mantle extends from 2890 km up¬ 
ward up to a depth of few tens of kilometres. The 
part below 650 km is often identified as the lower 
mantle. It contains about 49 % of the mass and 
is composed mainly of silicon, magnesium, and 
oxygen but some iron, calcium, and aluminium 
may also exist. The major minerals are olivine 
(Mg,Fe) 2 Si 04 and pyroxene (Mg,Fe)Si 03 . Un¬ 
der pressure the material behaves like a viscous 
liquid or an amorphous medium, resulting in slow 
vertical flows. 

Between the lower and upper mantle there is 
a 250 km thick transition region or mesosphere. It 
is the source of basaltic magmas and is rich in cal¬ 
cium and aluminium. The upper mantle, between 
some tens of kilometres down to 400 km contains 
about 10 % of the mass. Part of the upper man¬ 
tle, called the asthenosphere, might be partially 
molten. 

A thin crust floats on the mantle. The thick¬ 
ness of the crust is only 10-65 km; it is thick¬ 
est below high mountain ranges such as the 
Himalayas and thinnest below the mid-ocean 
basins. The seismic discontinuity showing the 
border between the crust and mantle was discov¬ 
ered in 1909 by the Croatian scientist Andrija 
Mohorovicic, and it is now known as the Moho 
discontinuity. 

The basaltic oceanic crust is very young, 
mostly less than 100 million years and nowhere 


more than 200 Ma. It is made through tectonic 
activity at the mid-ocean ridges, where new ma¬ 
terial is spreading out. 

The continental crust is mainly composed of 
crystalline rocks that are dominated by quartz 
(SiC> 2 ) and feldspars (metal-poor silicates). Be¬ 
cause the continental crust is lighter than the 
oceanic crust (average densities are about 
2700 kg m -3 and 3000 kg m -3 , respectively), the 
continents are floating on top of other layers, and 
currently they are neither created nor destroyed. 

The lithosphere is the rigid outer part of the 
Earth (crust and the topmost part of the upper 
mantle). Below that is the partially molten as¬ 
thenosphere where the damping of seismic waves 
is stronger than in the rigid lithosphere. 

The lithosphere is not a single rigid and seam¬ 
less layer; instead it is divided into more than 20 
individual plates. The plate tectonics (“continen¬ 
tal drift”) is powered by the motion of the ma¬ 
terial in the mantle (Fig. 8.7). New material is 
flowing up at the mid-ocean ridges, pushing the 
tectonic plates apart. New oceanic crust is gen¬ 
erated at the rate of 17 km 3 per year. The Earth 
is the only planet that shows any large-scale tec¬ 
tonic activity. The history of the motion can be 
studied by using e.g. the paleomagnetic data of 
magnetic orientation of crystallised rocks. 

At the end of the Precambrian era, about 700 
million years ago, more than half of the conti¬ 
nents were together forming the continent known 
as Gondwana, containing Africa, South Amer¬ 
ica, Australia and Antarctica. About 350 mil¬ 
lion years ago Gondwana was on the South Pole 
but it moved toward the equator before the final 
breakup. Mutual collisions formed new moun¬ 
tains and finally in the beginning of the Mesozoic 
era, about 200 million years ago, all the conti¬ 
nents were joined into one supercontinent, Pan¬ 
gaea. 

Quite soon the flow pattern in the mantle 
changed and the Pangaea broke up. The Atlantic 
Ocean is still growing and new material is flow¬ 
ing up at the mid-Atlantic ridge. North America 
is drifting away from Europe at the rate of a few 
centimetres per year (your fingernails are grow¬ 
ing at the same speed). At the same time, parts 
of the Pacific oceanic plate are disappearing be¬ 
low other plates. When an oceanic crust is pushed 
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Fig. 8.7 The tectonic plates. The dots on the map indicate 
the location of earthquakes with magnitudes greater than 5 
in the years 1980-1989. Arrows show the velocities ob- 

below a continental crust, a zone of active volca¬ 
noes is created. The earthquakes in the subduc- 
tion zones can even originate 600 km below the 
surface. In the mid-ocean ridges, the depth is only 
tens of kilometres. 

Mountains are formed when two plates col¬ 
lide, The push of the African plate toward the 
Eurasian plate formed the Alps about 45 million 
years ago. The collision of the Indian plate cre¬ 
ated the Himalayas some 40 million years ago, 
and they are still growing. The plates can also 
move sideways. Earthquakes in such a fault usu¬ 
ally occur in depths less than about 100 kilome¬ 
tres. An example is the San Andreas fault in Cal¬ 
ifornia. 

No other planet has such large scale tectonic 
activity. The interiors of Mars and Mercury are 
already too cold to maintain convection in their 
mantles. The structure of the crust of Venus is too 
weak to move large regions. Also, the free water 
of the Earth may be important in keeping up the 
plate tectonics. 

The climate is an efficient factor reshaping the 
surface. The mountains created by collisions of 
tectonic plates are eroded already in a few hun¬ 
dred million years by temperature changes, rain, 
wind and ice. Impact crater disappear even faster. 


served with permanent GPS (Global Positioning System) 
tracking stations. The velocity scale is shown at lower left 


and many of the currently known craters have 
been found by local gravitational anomalies in¬ 
stead of geographical features. 

Most of the Earth’s surface is covered with wa¬ 
ter. The Earth is the only planet that currently has 
free running water. The water condensed from the 
water vapour released in volcanic eruptions. De¬ 
pending on glaciation, the sea level ascends or 
descends by over 100 metres with respect to the 
solid ground. During the last ice age the sea level 
was 150 m below the current. Due to the melting 
of glaciers the sea level is rising by one millimetre 
per year, but the rate is increasing by the green¬ 
house effect. 

Compared with the geological time scale the 
increase in the greenhouse effect caused by hu¬ 
man activity is a very young phenomenon. Ice 
ages have come and gone in periods of a time 
scales of 100,000 years. The current warming of 
the climate is a temporary phenomenon, which, 
however, may have some effect on the beginning 
of the next ice age. Although the human activi¬ 
ties have little effect from the geological point of 
view, they may have disastrous effects on the cur¬ 
rent biosphere. 

The primordial atmosphere of the Earth was 
very different from the modern one; there was, for 
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Fig. 8.8 Hurricane Elena 
in the Gulf of Mexico seen 
from the space shuttle 
Discovery in September 1, 
1985. Wind speeds may 
exceed 170 km/h. Compare 
this to the Great Red Spot 
of Jupiter in Fig. 8.18. 
(NOAA) 



example, no oxygen. Earlier it was assumed that 
the original atmosphere was reducing, but more 
recent studies seem to indicate that it was neutral. 
When organic chemical processes started in the 
oceans more than 2 x 10 9 years ago and reached 
the level of photosynthesis, the amount of oxy¬ 
gen rapidly increased (and was poison to the 
first forms of life!). The original carbon dioxide 
is now mainly concentrated in carbonate rocks, 
such as limestone, and the methane was disso¬ 
ciated by solar UV radiation. The current atmo¬ 
sphere has very little resemblance to the early at¬ 
mosphere a couple of billion years ago. 

The Earth’s main atmospheric constituents are 
nitrogen (77 % by volume) and oxygen (21 %). 
Other gases, such as argon, carbon dioxide, and 
water vapour are present in minor amounts. The 
chemical composition is unchanged in the lower 
part of the atmosphere, called the troposphere. 
Most of the climatic phenomena occur in the tro¬ 
posphere, which reaches up to 8-10 km (Fig. 8.8). 
The height of the layer is variable, being lowest at 
the poles, and highest at the equator, where it can 
extend up to 18 km. 

The layer above the troposphere is the strato¬ 
sphere, extending up to 60 km. The boundary 


between the troposphere and the stratosphere 
is called the tropopause. In the troposphere, 
the temperature decreases 5-7 K/km, but in the 
stratosphere it begins to rise, due to the absorp¬ 
tion of solar radiation by carbon dioxide, wa¬ 
ter vapour and ozone. The ozone layer, which 
shields the Earth from the solar UV radiation, is 
at a height of 20-25 km. 

A total of 99 % of air is in the troposphere 
and stratosphere. The stratopause at a height of 
50-60 km separates the stratosphere from the 
mesosphere. 

The mesosphere extends up to 85 km. In this 
layer, the temperature decreases again, reaching 
the minimum of about —90 °C at the height of 
80-90 km in the mesopause. Chemicals in the 
mesosphere are mostly in an excited state, as they 
absorb energy from the Sun. 

Above the mesopause is the thermosphere that 
extends up to 500 kilometres. The temperatures 
increases with altitude and can be above 1200 °C 
at the height of 500 km. The gas is in the form of 
a fully ionised plasma. Therefore, the layer above 
the mesopause is also called the ionosphere. 

The density of air below a height of 150 km 
is high enough to cause colliding meteoroids to 
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Fig. 8.9 A map of the Lunar surface, composed of im¬ 
ages taken by the Clementine space probe in 1994. Note 
the large areas of maria in the Lunar near side, at the centre 

burn into ashes due to friction. It also plays an 
important role in radio communications, since ra¬ 
dio waves are reflected by the ionosphere. Auro¬ 
ras are phenomena of the upper part of the iono¬ 
sphere. 

The thermosphere goes over into the exo¬ 
sphere at about 500 km. There the air pressure is 
much lower than in the best laboratory vacuums. 

The magnetic field of the Earth is generated by 
flows in its core. The field is almost a dipole but 
there are considerable local and temporal varia¬ 
tions. The mean field strength close to the equa¬ 
tor is 3.1 x 10 -5 Tesla (0.31 Gauss). The dipole is 
tilted 11° with respect to the Earth’s axis, but the 
direction gradually changes with time. Currently 
the field is decreasing about 30 nT a year and 
moving counterclockwise; however, the angle be¬ 
tween the magnetic axis and the rotation axis re¬ 
mains nearly constant. Moreover, the magnetic 
north and south poles have exchanged places sev¬ 
eral times during the past, once every 100,000- 
1,000,000 years. The last time this happened was 
about 780,000 years ago. 

The Moon Our nearest neighbour in space is 
the Moon (Fig. 8.9). Its motion was described in 
Sect. 7.4. Dark and light areas are visible even 


of the figure, as compared to the almost complete absence 
of the maria at the Lunar far side. (US Naval Observatory) 


with the naked eye. For historical reasons, the for¬ 
mer are called seas or maria (from Latin, mare, 
sea, pi. maria). The lighter areas are uplands but 
the maria have nothing in common with terres¬ 
trial seas, since there is no water on the Moon. 
Numerous craters, all meteorite impacts, can be 
seen, even with binoculars or a small telescope. 
The lack of atmosphere, volcanism, and tectonic 
activity help to preserve these formations. 

The Moon is the best-known body after the 
Earth. The first man landed on the Moon in 
1969 during the Apollo 11 flight. A total of over 
2000 samples, weighing 382 kg, were collected 
during the six Apollo flights (Fig. 8.10). More¬ 
over, the unmanned Soviet Luna spacecraft col¬ 
lected and returned about 310 grams of Lunar 
soil. Instruments placed on the Moon by the 
Apollo astronauts operated as long as eight years. 
These included seismometers, which detected 
moonquakes and meteorite impacts, and passive 
laser reflectors which made exact Earth-Moon 
distance measurements possible. The reflectors 
are still used for Lunar laser ranging (LLR) mea¬ 
surements. 

Seismometric and gravimetric measurements 
have supplied basic information on the internal 
structure of the Moon. Moonquakes take place 










8.3 The Earth and the Moon 


193 


Fig. 8.10 Apollo 17 
astronaut Harrison Schmitt 
on the Moon in 1972. 
(NASA) 



at a depth of 800-1000 km, considerably deeper 
than earthquakes, and they are also much weaker 
than on the Earth. Most of the quakes occur at 
the boundary of the solid mantle, the lithosphere, 
and the asthenosphere (Fig. 8.11). The transver¬ 
sal S waves cannot penetrate the asthenosphere, 
indicating that it is at least partially molten. Tidal 
forces may generate at least some of the moon- 
quakes because most of them occur close to 
perigee or apogee. 

Lunar orbiters have observed local mass con¬ 
centrations, mascons, beneath the maria. These 
are large basaltic blocks, formed after the huge 
impacts which produced the maria. The craters 
were filled by lava flows during the next billion 
years or so in several phases. This can be seen, 
e.g. in the area of Mare Imbrium. Large maria 
were formed about 4 x 10 9 years ago when mete¬ 
orite bombardment was much heavier than today. 
The last 3 x 10 9 years have been quite peaceful, 
without any major events. 

The centre of mass is not at the geometric cen¬ 
tre of the Moon but about 2.5 km away due to 
the 20-30 km thick basaltic plates below the large 
maria. Moreover, the thickness of the crust varies, 
being the thickest at the far side of the Moon, 



Fig. 8.11 Structure of the Moon. The height differences 
of the surface are strongly exaggerated 


about 100 km. On the near side the thickness of 
the crust is about 60 km. 

The mean density of the Moon is 3400 kg m , 
which is comparable to that of basaltic lavas on 
the Earth. The Moon is covered with a layer of 
soil with scattered rocks, regolith. It consists of 
the debris blasted out by meteorite impacts. The 
original surface is nowhere visible. The thickness 
of the regolith is estimated to be at least tens of 
metres. A special type of rock, breccia, which 
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is a fragment of different rocks compacted and 
welded together by meteor impacts, is found ev¬ 
erywhere on the Moon. 

The maria are mostly composed of dark ba¬ 
salts, which form from rapid cooling of massive 
lava flows. The highlands are largely composed 
of anorthosite, an igneous rock that forms when 
lava cools more slowly than in the case of basalts. 
This implies that the rocks of the maria and high¬ 
lands cooled at different rates from the molten 
state and were formed under different conditions. 

Data returned by the Lunar Prospector and 
Clementine spacecraft indicated that water ice is 
present at both the north and south lunar poles. 
Data indicates that there may be nearly pure wa¬ 
ter ice buried beneath the dry regolith. The ice is 
concentrated at the bottoms of deep valleys and 
craters that are in a permanent shadow where the 
temperature is below 100 K. 

The Moon has no global magnetic field. Some 
of the rocks have remanent magnetism indicat¬ 
ing a possible global magnetic field early in 
the Moon’s history. Without the atmosphere and 
magnetic field, the solar wind can reach the 
Moon’s surface directly. The ions from the solar 
wind have embedded in the regolith. Thus sam¬ 
ples returned by the Apollo missions proved valu¬ 
able in studies of the solar wind. 

The origin of the Moon is still uncertain; it has, 
however, not been torn off from the Earth at the 
Pacific Ocean, as is sometimes believed. The Pa¬ 
cific is less than 200 million years old and formed 
as a result of continental drift. Also, the chemical 
composition of the lunar soil is different from that 
of terrestrial material. 

Recently it was suggested that the Moon was 
formed in the early stages of the formation of the 
Earth, when a lot of protoplanet embryos were 
orbiting the Sun. An off-axis collision of a Mars- 
size body resulted in ejection of a large amount 
of debris, a part of which then accreted to form 
the Moon. Differences in chemical compositions 
of the modern Earth and the Moon can be ex¬ 
plained with the theory, as well as the orientation 
and evolution of the Moon’s orbit and the Earth’s 
relatively fast spin rate. 


Box 8.1 (Atmospheric Phenomena) The best- 
known atmospheric phenomenon is the rain¬ 
bow, which is due to the refraction of light 
from water droplets. The radius of the arc of 
the rainbow is about 41° and the width, 1.7°. 
The centre of the arc is opposite the Sun (or 
any other source of light). When the light is re¬ 
fracted inside a water droplet, it is divided into 
a spectrum, where the red colour is at the outer 
edge and blue, at the inner edge. Light can be 
reflected twice inside the droplet, resulting in 
a secondary rainbow outside the primary one. 
The colours of the secondary rainbow are in re¬ 
versed order and its radius is 52°. A rainbow 
caused by the Moon is usually very weak and 
colourless, since the human eye is incapable of 
resolving colours of a dim object. 

A halo results when the solar or lunar light 
is reflected from atmospheric ice crystals. The 
most common halo is a 22° arc or circle around 
the Sun or the Moon. Usually the halo is white, 
but occasionally even bright colours can be 
seen. Another common form is the side lobes 
which are at the same height as the Sun but 
at a distance of 22° from it. All other forms 
of halo are less common. The best “weather” 
for halos is when there are cirrostratus or citrus 
clouds or an icy fog in the sky. 

Noctilucent clouds are thin formations of 
cloud, at a height of approximately 80 km. The 
clouds contain particles, which are less than 
one micron in diameter, and become visible 
only when the Sun (which is below the hori¬ 
zon) illuminates the clouds. Most favourable 
conditions are at the northern latitudes during 
the summer nights when the Sun is only a few 
degrees below the horizon. 

The night sky is never absolutely dark. One 
reason (in addition to light pollution) is the air- 
glow or light emitted by excited atmospheric 
molecules. Most of the radiation is in the in¬ 
frared domain, but e.g. the forbidden line of 
oxygen at 558 nm, has also been detected. 

The same greenish oxygen line is clearly 
seen in auroras, which are formed at a height 
of 80-300 km. Auroras can be seen mainly 
from relatively high northern or southern lati- 
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Top: Halos are particularly common in Antarctic (photo Marko Riikonen). 
Bottom: Auroras (photo Pekka Parviainen) 


tudes because the Earth’s magnetic field forces 
charged particles, coming from the Sun, close 
toward the magnetic poles. Alaska and north¬ 
ern Scandinavia are the best places to observe 
auroras. Occasionally, auroras can be seen as 
far south as 40°. They are usually greenish or 
yellow-green, but red auroras have been ob¬ 
served, too. They most commonly appear as 
arcs, which are often dim and motionless, or as 
belts, which are more active and may contain 
rapidly varying vertical rays. 


Meteors (also called shooting stars although 
they have nothing to do with stars) are small 
grains of sand, a few micrograms or grams in 
weight, which hit the Earth’s atmosphere. Due 
to friction, the body heats up and starts to glow 
at a height of 100 km. Some 20-40 km lower, 
the whole grain has burnt to ashes. The dura¬ 
tion of a typical meteor is less than a second. 
The brightest meteors are called bolides (mag¬ 
nitude smaller than about —2). Even larger par- 
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Fig. 8.12 Two pictures of Mars, taken by the Mars 
Global Surveyor in June and July 2001. The view from 
June (left) shows the Tharsis volcanic region, Valles 
Marineris and the late winter south polar cap. The view 


tides may survive down to the Earth. Meteors 
are further discussed in Sect. 8.13. 


8.4 Mars 

Mars is the outermost of the terrestrial planets. 
Its diameter is only half of that of the Earth. Seen 
through a telescope, Mars seems to be a reddish 
disk with dark spots and white polar caps. The po¬ 
lar caps wax and wane with the Martian seasons, 
indicating that they are composed of ice. Darker 
areas were suspected to be vegetation. At the end 
of the 19th century, an Italian astronomer, Gio¬ 
vanni Schiaparelli claimed that there are canals 
on Mars. 

In the United States, the famous planetary 
astronomer Percival Lowell studied the canals 
and even published books on the topic. Martians 
were also very popular in science fiction litera¬ 
ture. Now the canals are known to be nonexis¬ 
tent, an optical illusion when the obscure details 
at the limit of visibility seem to form straight 
lines, canals. Finally, the first clear pictures by 
Mariner 4 in 1965 buried even the most optimistic 
hopes concerning life on Mars. Later spacecraft 
revealed more details of the planet. 


from July shows the same regions, but most of the details 
are hidden by dust storms and haze. (NASA/JPL/Malin 
Space Science Systems) 


Mars is a superior planet, which means that it 
is most easily observable when it is closest to the 
Earth, i.e. during opposition, when the planet is 
above the horizon all night long. 

The rotation axis of Mars is tilted 25° to the 
ecliptic, about the same amount as the Earth’s 
axis. A Martian day is only half an hour longer 
than a terrestrial day. Mars’ orbit is significantly 
elliptical, resulting in temperature variations of 
about 30 °C at the subsolar point between the 
aphelion and perihelion. This has a major influ¬ 
ence on the climate. Huge dust storms are oc¬ 
casionally seen on Mars (Fig. 8.12). Usually the 
storms begin when Mars is at the perihelion. 
Heating of the surface gives rise to large temper¬ 
ature differences that in turn cause strong winds. 
The wind-driven dust absorbs more heat and fi¬ 
nally the whole planet is covered by a dust storm 
where the wind speeds exceed 100 m/s. 

The atmosphere of Mars is mainly composed 
of carbon dioxide (95 %). It contains only 2 % ni¬ 
trogen and 0.1-0.4 % oxygen. The atmosphere 
is very dry: if all the moisture were condensed 
on the surface, the water layer would be thin¬ 
ner than 0.1 mm. Even the minor amount of wa- 






8.4 Mars 


197 



Fig. 8.13 A topographic shade map of Mars made from 
the Mars Global Surveyor data. The most prominent fea¬ 
tures are the large shield volcanoes in the northern hemi- 

ter vapour is sufficient to occasionally form some 
thin clouds or haze. 

The air pressure is only 5-8 mbar. A part of 
the atmosphere has escaped but it is probable that 
Mars never had a thick atmosphere. The primor¬ 
dial atmosphere of Mars was, however, somewhat 
similar to that of the Earth. Almost all of its car¬ 
bon dioxide was used up to form carbonate rocks. 
Because there are no plate tectonics on Mars, the 
carbon dioxide was not recycled back into the at¬ 
mosphere as on the Earth. Therefore, the green¬ 
house effect on Mars is significantly smaller than 
on the Earth. 

Craters were already found in the first pictures. 
The southern hemisphere is especially marked by 
craters, indicating that the original surface is still 
visible there. The largest impacts, Hellas and Ar- 
gyre are about 2000 km in diameter. On the other 
hand, the northern hemisphere has an abundance 
of large lava basins and volcanoes (Fig. 8.15). 
The surface is younger than in the southern hemi¬ 
sphere. The largest volcano, Olympus Mons, pro¬ 
trudes more than 20 km above the surround¬ 
ing terrain. The diameter at the bottom is about 
600 km. 

There are no active volcanoes on Mars. The 
mare-like plains on Mars are of the same age as 


sphere and the Valles Marineris canyon system that is 
more than 3000 km long and up to 8 km deep. (MOLA 
Science Team/NASA) 

the Lunar maria, about 3 x 10 9 years old. Volcan- 
ism in the highland and mare-like plains stopped 
at that time, but the giant shield volcanoes are 
much younger, possibly 1-2 x 10 9 years. The 
youngest lava flows on Olympus Mons are pos¬ 
sibly less than 100 million years old. Mars shows 
no sign of plate tectonics. It has no mountain 
chains, nor any global patterns of volcanism. 

There are also several canyons, the largest of 
which is Valles Marineris (Fig. 8.13). Its length 
is 5000 km, width 200 km, and depth about 
6 km. Compared with Valles Marineris, the Grand 
Canyon is merely a scratch on the surface. 

Ancient riverbeds (Fig. 8.15), too small to be 
seen from the Earth, were also discovered by 
spacecraft. Rivers were probably formed soon af¬ 
ter the formation of Mars itself, when there was 
a great deal of water and the atmospheric pres¬ 
sure and temperature were higher. At present, the 
temperature and air pressure on Mars are too low 
for free water to exist, although there have been 
speculations on warm weather cycles in the more 
recent history of the planet. The mean temper¬ 
ature is now below —50 °C and, on a warm 
summer day, the temperature can rise close to 
zero near the equator. Most of the water is con¬ 
tained in kilometres deep permafrost below the 
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Fig. 8.14 The panorama taken by the Opportunity rover shows structures resembling terrestrial sediments that could 
have been caused by water. (NASA/JPL/Caltech) 



Fig. 8.15 Volcanoes, impact craters and rivers, (a) Mars 
Global Surveyor wide-angle view of Olympus Mons in 
April 1998. (b) Small impact craters and sand dunes with 
a resolution of 1.5 m per pixel. The picture covers a 1.5 km 
wide portion of Isidis Planitia. (c) Three major valley sys¬ 
tems east of the Hellas plains. These valleys have probably 

surface and in the polar caps. The theory was con¬ 
firmed in 2002, when the Mars Odyssey space¬ 
craft detected a large supply of subsurface wa¬ 
ter ice of a wide area near the south pole. The 
ice is mixed into the soil a meter below the sur¬ 
face. Two rovers, Spirit and Opportunity, oper¬ 
ating on the Mars discovered in 2004 minerals 
like hematite and goethite which proved the ex¬ 
istence of liquid water on the surface of Mars. 
The period when the liquid water existed is un¬ 
known. 

The polar caps are composed both of water 
and carbon dioxide ice. The northern cap is al¬ 


been formed by large outbursts of liquid water but the age 
of the valleys is unknown. The valleys are all roughly 1 km 
deep and 10-40 km wide. The picture covers an area ap¬ 
proximately 800 km across. (Mars Global Surveyor, 2000) 
(NASA/JPL/Malin Space Science Systems) 


most season-independent, extending down to lati¬ 
tude 70°. On the other hand, the southern cap, 
which reaches to the latitude —60° in the south¬ 
ern winter, disappears almost totally during the 
summer. The southern cap consists mostly of 
CO 2 ice. The permanent parts are of ordinary wa¬ 
ter ice, since the temperature, —73 °C, is too high 
for CO 2 ice. The water ice layers can be hundreds 
of metres thick. 

The dark areas are not vegetation, but loose 
dust, moved around by strong winds. These winds 
raise the dust high into the atmosphere, colouring 
the Martian sky red. The Mars landers have re- 
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Fig. 8.16 Phobos (left) 
and Deimos, the two 
moons of Mars. They can 
be captured asteroids. 
(NASA) 



vealed a reddish regolithic surface, scattered with 
boulders (Fig. 8.14). The red colour is caused 
mainly by iron oxide, rust; already in the 1950’s, 
the existence of limonite (2 FeC >3 3 H 2 O) was de¬ 
duced from polarisation measurements. The on¬ 
site analysis showed that the soil consists of 
13 % iron and 21 % silicon. The abundance of 
sulphur was found to be ten times that found on 
the Earth. 

The interior of Mars is not well known. 
Mars has probably a dense core approximately 
1500 km in radius, a molten rocky mantle which 
is denser than the Earth’s mantle and a thin crust. 
The crust is 80 km thick in the southern hemi¬ 
sphere but only about 35 km thick in the northern 
one. The low mean density compared with other 
terrestrial planets may indicate that in addition to 
iron the core contains a relatively large fraction 
of sulphur. 

The Mars Global Surveyor confirmed in 1997 
a weak magnetic field. It is probably a remnant of 
an earlier global field that has since disappeared. 
This has important implications for the structure 
of Mars’ interior. There are no electric currents 
creating a magnetic field and therefore the core 
may be (at least partially) solid. 

Three biological experiments of the Viking 
landers in 1976 searched for signs of life. No or¬ 
ganic compounds were found—however, the bi¬ 
ological tests did give some unexpected results. 
A closer look at the results indicated no life, but 
some uncommon chemical reactions. 


Mars has two moons, Phobos and Deimos 
(Fig. 8.16). The size of Phobos is roughly 27 km x 
21 km x 19 km, and the orbital period around 
Mars is only 7 h 39 min. In the Martian sky, Pho¬ 
bos rises in the west and sets in the east. Deimos 
is smaller. Its diameter is 15 km x 12 km x 11 km. 
The orbital period of Deimos is slightly over 30 
hours, or just six hours longer than the rotation 
period of Mars. The synodic period of Deimos is 
nearly 5.5 days, and so it will stay over 2.5 days 
above the Martian horizon. 

There are craters on both moons. Polarimetric 
and photometric results show that they are com¬ 
posed of material resembling carbonaceous chon- 
drite meteorites. 


8.5 Jupiter 

The realm of terrestrial planets ends at the aster¬ 
oid belt. Outside this, the relative abundance of 
volatile elements is higher and the original com¬ 
position of the solar nebula is still preserved in 
the giant planets. The first and largest is Jupiter. 
Its mass is 2.5 times the total mass of all other 
planets, almost 1/1000 of the solar mass. The 
bulk of Jupiter is mainly hydrogen and helium. 
The relative abundance of these elements are ap¬ 
proximately the same as in the Sun, and the den¬ 
sity is of the same order of magnitude, namely 
1330 kg m“ 3 . 

During oppositions, the angular diameter of 
Jupiter is as large as 50". The dark belts and 
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Fig. 8.17 A composed 
image of Jupiter taken by 
the Cassini spacecraft in 
December 2000. The 
resolution is 114 km/pixel. 
The dark dot is the shadow 
of the moon Europa. 
(NASA/JPL/University of 
Arizona) 



lighter zones are visible even with a small tele¬ 
scope. These are cloud formations, parallel to the 
equator (Fig. 8.17). The most famous detail is the 
Great Red Spot, a huge cyclone, rotating coun¬ 
terclockwise once every six days. The spot was 
discovered by Giovanni Cassini in 1655; it has 
survived for centuries, but its true age is unknown 
(Fig. 8.18). 

The rotation of Jupiter is rapid; one revolu¬ 
tion takes 9 h 55 min 29.7 s. This is the period 
determined from the variation of the magnetic 
held, and it reflects the speed of Jupiter’s inte¬ 
riors where the magnetic held is born. As might 
be expected, Jupiter does not behave like a rigid 
body. The rotation period of the clouds is about 
hve minutes longer in the polar region than at the 
equator. Due to its rapid rotation, Jupiter is non- 
spherical; battening is as large as 1/15. 

There is possibly an iron-nickel core in the 
centre of Jupiter. The mass of the core is proba¬ 
bly equal to a few tens of Earth masses. The core 
is surrounded by a layer of metallic liquid hy¬ 
drogen, where the temperature is over 10,000 K 


and the pressure, three million atm. Owing to this 
huge pressure, the hydrogen is dissociated into 
single atoms, a state unknown in ordinary labora¬ 
tory environments. In this exotic state, hydrogen 
has many features typical of metals. This layer 
is electrically conductive, giving rise to a strong 
magnetic held. Closer to the surface where the 
pressure is lower, the hydrogen is present as nor¬ 
mal molecular hydrogen, H 2 . At the top there is 
a 1000 km thick atmosphere. 

The atmospheric state and composition of 
Jupiter has been accurately measured by the 
spacecraft. In situ observations were obtained in 
1995, when the probe of the Galileo spacecraft 
was dropped into Jupiter’s atmosphere. It sur¬ 
vived nearly an hour before crushing under the 
pressure, collecting the hrst direct measurements 
of Jupiter’s atmosphere. 

Belts and zones are stable cloud formations. 
Their width and colour may vary with time, but 
the semi-regular pattern can be seen up to the lat¬ 
itude 50°. The colour of the polar areas is close to 
that of the belts. The belts are reddish or brown- 
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Fig. 8.18 Jupiter’s Great 
Red Spot and its 
surroundings with several 
smaller ovals as seen by 
Voyager 1 in 1979. Cloud 
details of 160 kilometres 
are visible. (NASA) 



ish, and the motion of the gas inside a belt is 
downward. The gas flows upward in the white 
zones. The clouds in the zones are slightly higher 
and have a lower temperature than those in the 
belts. Strong winds or jet streams blow along the 
zones and belts. The speed of the wind reaches 
150 m/s at some places in the upper atmosphere. 
According to the measurements of the Galileo 
probe, the wind speeds in the lower cloud lay¬ 
ers can reach up to 500 m/s. This indicates that 
the winds in deeper atmospheric layers are driven 
by the outflowing flux of the internal heat, not the 
solar heating. 

The colour of the Great Red Spot (GRS) re¬ 
sembles the colour of the belts (Fig. 8.18). Some¬ 
times it is almost colourless, but shows no signs 
of decrepitude. The GRS is 14,000 km wide and 
30,000-40,000 km long. Some smaller red and 
white spots can also be observed on Jupiter, but 
their lifetime is generally much less than a few 
years. 

The ratio of helium to hydrogen in the deep 
atmosphere is about the same as in the Sun. The 


results of the Galileo spacecraft gave consider¬ 
ably higher abundance than previous estimates. 
It means that there are no significant differenti¬ 
ation of helium, i.e. helium is not sinking to the 
interior of the planet as was expected according 
to the earlier results. Other compounds found in 
the atmosphere include methane, ethane and am¬ 
monia. The temperature in the cloud tops is about 
130 K. 

Jupiter radiates twice the amount of heat that 
it receives from the Sun. This heat is a remnant 
of the energy released in the gravitational con¬ 
traction during the formation of the planet. Thus 
Jupiter is still gradually cooling. The internal heat 
is transferred outward by convection; this gives 
rise to flows in the metallic hydrogen, causing the 
strong magnetic field. 

The ring of Jupiter (Fig. 8.19) was discovered 
in 1979. The innermost toroid-shaped halo is be¬ 
tween 92,000-122,500 km from Jupiter’s centre. 
It consists of dust falling from the main ring to¬ 
ward the planet. The main ring extends from the 
halo boundary out to about 128,940 km, just in- 
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Fig. 8.19 Mosaic of Jupiter’s ring system taken by the 
Galileo spacecraft when the spacecraft was in Jupiter’s 
shadow looking back toward the Sun. Jupiter’s ring sys¬ 
tem is composed of three parts: a thin outermost ring, a flat 

side the orbit of the moon Adrastea. The ring 
particles are small, a few microns only, and they 
scatter light forward much more effectively than 
backward. Therefore, they were not discovered 
prior the Voyager flyby. A ring consisting of such 
small particles cannot be stable, and new material 
must enter the ring continuously. Possible sources 
are the small moons Adrastea and Metis. 

The two faint outermost rings are fairly uni¬ 
form in nature. The inner of them extends from 
the orbit of Adrastea out to the orbit of Amalthea 
at 181,000 km. The fainter outermost ring ex¬ 
tends out to Thebe’s orbit at 221,000 km. 

Jupiter’s rings and moons exist within an in¬ 
tense radiation belt of Jupiter’s magnetic field. 
The magnetosphere extends 3-7 million kilome¬ 
tres toward the Sun, depending on the strength 
of the solar wind. In the opposite direction it 
stretches to a distance of at least 750 million kilo¬ 
metres, behind Saturn’s orbit. 

Jupiter is an intense radio source. Its radio 
emission can be divided into three components, 
namely thermal millimetre and centimetre radia¬ 
tion, nonthermal decimetric radiation and burstal- 
decametric radiation. The nonthermal emission is 
most interesting; it is partly synchrotron radia¬ 
tion, generated by relativistic electrons in the Jo¬ 
vian magnetosphere. Its intensity varies in phase 
with Jupiter’s rotation; thus the radio emission 
can be used for determining the exact rotation 


main ring, and an innermost doughnut-shaped halo. These 
rings are made up of dust-sized particles that originate 
from Io, or are blasted off from the nearby inner satellites 
by small impacts. (NASA/University of Arizona) 

rate. The decametric bursts are related to the po¬ 
sition of the innermost large moon, Io, and are 
possibly generated by the electric current of mil¬ 
lions of Amperes observed between Jupiter and 
the plasma torus at the orbit of Io. Also auroras 
are common in Jupiter (Fig. 8.20). 

In the beginning of year 2016 there were 
67 known moons of Jupiter, but probably more 
small satellites will be found in the future. The 
four largest, Io, Europa, Ganymede and Callisto 
are called the Galilean satellites (Fig. 8.21), in 
honour of Galileo Galilei, who discovered them 
in 1610. The Galilean satellites can already be 
seen with ordinary binoculars. They are the size 
of the Moon or even planet Mercury. The other 
moons are small, most of them only a few kilo¬ 
metres in diameter. 

Owing to tidal forces, the orbits of Io, Eu¬ 
ropa and Ganymede have been locked into a res¬ 
onance, so that their longitudes X strictly satisfy 
the equation 

Mo 3/-H U ropa + 2/.(; an y rne dc — 180 . (8.1) 

Hence the moons can never be in the same direc¬ 
tion when seen from Jupiter. 

Io is the innermost Galilean satellite. It is a lit¬ 
tle larger than the Moon. Its surface is spotted by 
numerous calderas, volcanoes without a moun¬ 
tain. The molten material is ejected up to a height 
of 250 km, and a part of the gas gets into Io’s or- 
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Fig. 8.20 Left: NASA Hubble Space Telescope close-up 
view of an aurora on Jupiter. The image shows the main 
oval of the aurora, centred over the magnetic north pole, 
and diffuse emissions inside the polar cap. (NASA, John 
Clarke/University of Michigan) Right: The image taken 

bit. The volcanic activity on Io is much stronger 
than on the Earth. There is a 100 m bulk of 
the permanent tide raised by Jupiter. Due to 
the orbital perturbations caused by Europa and 
Ganymede the orbit of lo is slightly elliptical and 
therefore the orbital speed varies. The tidal bulk 
is forced to move with respect to the surface. This 
generates friction, which is transformed to heat. 
This heat keeps the sulphur compounds molten 
beneath the colourful surface of Io. No traces of 
impact craters are visible. The whole surface is 
new, being renewed continuously by eruptions. 
There is no water on Io. 

Europa is the smallest of the Galilean satel¬ 
lites, a little smaller than the Moon. The surface 
is ice-covered and the geometric albedo is as high 
as 0.6. The surface is smooth with only a few fea¬ 
tures more than a hundred metres high. Most of 
the markings seem to be albedo features with very 
low relief. Only a few impact craters have been 
found indicating that the surface is young. The 
surface is renewed by fresh water, trickling from 
the internal ocean. Galileo spacecraft has found 
a very weak magnetic field. The field varies pe¬ 
riodically as it passes through Jupiter’s magnetic 
field. This shows that there is a conducting mate¬ 
rial beneath Europa’s surface, most likely a salty 
ocean that could even be 100 km deep. At the cen¬ 
tre, there is a solid silicate core. 


on January 2001 by NASA’s Cassini spacecraft shows the 
bubble of charged particles trapped in the magnetosphere. 
The magnetic field and the torus of the ionised mate¬ 
rial from the volcanoes of Io are drawn over the image. 
(NASA/JPL/Johns Hopkins University) 

Ganymede is the largest moon in the solar sys¬ 
tem. Its diameter is 5300 km; it is larger than 
the planet Mercury. The density of craters on the 
surface varies, indicating that there are areas of 
different ages. Ganymede’s surface is partly very 
old, highly cratered dark regions, and somewhat 
younger but still ancient lighter regions marked 
with an extensive array of grooves and ridges. 
They have a tectonic origin, but the details of the 
formations are unknown. About 50 % of the mass 
of the moon is water or ice, the other half being 
silicates (rocks). Contrary to Callisto, Ganymede 
is differentiated: a small iron or iron/sulphur core 
surrounded by a rocky silicate mantle with an 
icy (or liquid water) shell on top. Ganymede has 
a weak magnetic field. 

Callisto is the outermost of the large moons. 
It is dark; its geometric albedo is less than 0.2. 
Callisto seems to be undifferentiated, with only 
a slight increase of rock toward the centre. About 
40 % of Callisto is ice and 60 % rock/iron. The 
ancient surface is peppered by meteorite craters; 
no signs of tectonic activity are visible. How¬ 
ever, there have been some later processes, be¬ 
cause small craters have mostly been obliterated 
and ancient craters have collapsed. 

The currently known moons can be divided 
into four different groups: small moons inside the 
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Fig. 8.21 (Top) The Galilean satellites of Jupiter. From Europa. (Right page bottom) Surface details of Ganymede 

left to right: Io, Europa, Ganymede, and Callisto and Callisto. (NASA/Brown University, NASA/JPL) 

(NASA/DLR). (Right page top) Surface details of Io and 
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Fig. 8.22 Saturn and its rings. Three satellites (Tethys, 
Dione, and Rhea) are seen to the left of Saturn, and the 
shadows of Mimas and Tethys are visible on Saturn’s 
cloud tops. (NASA/JPL) 

orbits of the Galilean satellites, the Galilean satel¬ 
lites, and two groups of irregular moons outside 
the orbit of the Galilean satellites. The moons of 
the inner distant group move on direct orbits, and 
the moons of the most distant group on retrograde 
orbits. It is possible that many of these are small 
asteroids captured by Jupiter. 


8.6 Saturn 

Saturn (Fig. 8.22) is the second largest planet. Its 
diameter is about 120,000 km, ten times the diam¬ 
eter of the Earth, and the mass, 95 Earth masses. 
The density is only 700 kgm -3 , less than the 
density of water. The rotation axis is tilted about 
21° with respect to the orbital plane, so every 15 
years, the northern or the southern pole is well 
observable. 

The rotation period is 10 h 39.4 min, deter¬ 
mined from the periodic variation of the magnetic 



Fig. 8.23 A schematic drawing of the structure of the 
Saturnian rings. The brightest A and B rings can be seen 
even with a small telescope; the other ones are very faint 

field by the Voyager spacecraft in 1981. However, 
Cassini spacecraft observed in 2004 the period 
of 10 h 45 min. The reason for the change is un¬ 
known. Due to the rapid rotation, Saturn is flat¬ 
tened; the flattening is 1/10, which can be easily 
seen even with a small telescope. 

The internal structure of Saturn resembles that 
of Jupiter. Due to its smaller size, the metallic hy¬ 
drogen layer is not so thick as on Jupiter. The 
thermal radiation of Saturn is 2.8 times that of 
the incoming solar flux. The heat excess origi¬ 
nates from the differentiation of helium. The he¬ 
lium atoms are gradually sinking inward and the 
released potential energy is radiated out as a ther¬ 
mal radiation. The abundance of helium in Sat¬ 
urn’s atmosphere is only about half of that on 
Jupiter. 

The winds, or jet streams, are similar to those 
of Jupiter but Saturn’s appearance is less colour¬ 
ful. Viewed from the Earth, Saturn is a yellowish 
disk without any conspicuous details. The clouds 
have fewer features than those on Jupiter, be¬ 
cause a haze, composed of hydrogen, ammonium 
and methane floats above the cloud tops. Further¬ 
more, Saturn is farther from the Sun than Jupiter 
and thus has a different energy budget. 

The temperature at the cloud tops is about 
94 K. Close to the equator the wind speeds ex¬ 
ceed 400 m/s and the zone in which the direc¬ 
tion of the wind remains the same extends 40° 
from the equator. Such high speeds cannot be ex¬ 
plained with external solar heat, but the reason for 
the winds is the internal flux of heat. 

Saturn’s most remarkable feature is a thin ring 
system (Figs. 8.23, 8.24), lying in the planet’s 
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equatorial plane. The Saturnian rings can be seen 
even with a small telescope. The rings were dis¬ 
covered by Galileo Galilei in 1610; only 45 years 
later did Christian Huygens establish that the for¬ 
mation observed was actually a ring, and not 
two oddly behaving bulbs, as they appeared to 
Galileo. In 1857 James Clerk Maxwell showed 
theoretically that the rings cannot be solid but 
must be composed of small particles. 

The rings are made of normal water ice. The 
size of the ring particles ranges from microns to 
truck-size chunks. Most of the particles are in 
range of centimetres to metres. The width of the 
ring system is more than 60,000 km (about the ra¬ 
dius of Saturn) and the thickness, at most 100 m, 
and possibly only a few metres. Cassini space¬ 
craft discovered also molecular oxygen around 
the rings, probably as a product of the disintegra¬ 
tion of water ice from the rings. 

According to Earth-based observations, the 
rings are divided into three parts, called simply 
A, B, and C. The innermost C ring is 17,000 km 
wide and consists of very thin material. There is 
some material even inside this (referred to as the 
D ring), and a haze of particles may extend down 
to the clouds of Saturn. 

The B ring is the brightest ring. Its total width 
is 26,000 km, but the ring is divided into thou¬ 
sands of narrow ringlets, seen only by the space¬ 
craft. From the Earth, the ring seems more or less 



Fig. 8.24 At a close distance, the rings can be seen to be 
divided into thousands of narrow ringlets. (JPL/NASA) 


uniform. Between B and A, there is a 3000 km 
wide gap, the Cassini division. It is not totally 
void, as was previously believed; some material 
and even narrow ringlets have been found in the 
division by the Voyager space probes. 

The A ring is not divided into narrow ringlets 
as clearly as the B ring. There is one narrow but 
obvious gap, Encke’s division, close to the outer 
edge of the ring. The outer edge is very sharp, due 
to the “shepherd” moon, some 800 km outside the 
ring. The moon prevents the ring particles from 
spreading out to larger orbits. It is possible that 
the appearance of B is due to yet-undiscovered 
moonlets inside the ring. 

The F ring, discovered in 1979, is about 
3000 km outside A. The ring is only a few hun¬ 
dred kilometres wide. On both sides there is 
a small moon; these shepherds prevent the ring 
from spreading. An interior moon passing a ring 
particle causes the particle to move to a larger or¬ 
bit. Similarly, at the outer edge of the ring, a sec¬ 
ond moon forces the particles inward. The net 
result is that the ring is kept narrow. 

Outside the F ring, there are some zones of 
very sparse material, sometimes referred to as the 
G and E rings. These are merely collections of 
small particles. 

The Saturnian rings were possibly formed to¬ 
gether with Saturn and are not debris from some 
cosmic catastrophe, like remnants of a broken 
moon. The total mass of the rings is 10 -7 of the 
mass of Saturn. If all ring particles were collected 
together, they would form an ice ball, 600 km in 
diameter. 

In 2009 a very sparse ring was observed at 
the distance of the moon Phoebe. It has the same 
inclination as Phoebe, nearly 30°. It has proba¬ 
bly originated in material shattered from Phoebe 
by micrometeorites. The ring extends inward to 
the orbit of Iapetus. The hemispheres of Iapetus 
are quite different, one side is very dark. Since 
Iapetus always turns the same side towards Sat¬ 
urn, only one hemisphere is hit by ring particles. 
Eventually ice evaporates away leaving only the 
dark cinder. 

A total of 62 moons (in 2016) of Saturn 
are known. Many of the large Saturnian moons 
(Fig. 8.25) were observed by Pioneer 11 and Voy¬ 
ager 1 and 2 spacecrafts. Some of the large moons 
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Fig. 8.25 Saturnian moons photographed by the Cassini 
spacecraft in 2005-2006 (a) Hyperion, (b) Enceladus, 
(c) Iapetus and (d) Tethys. (e) A radar picture of the north¬ 
like Enceladus and Tethys are composed mainly 
of ice. The temperature of the primeval nebula at 
the distance of Saturn was so low that bodies of 
pure ice could form and survive. 

Some moons are dynamically interesting; 
some have an exotic geological past. Outside the 
F ring, there are two moonlets, Epimetheus and 
Janus , almost in the same orbit; the difference of 
the semimajor axes is about 50 km, less than the 
radii of the moons. The inner moon is gaining on 
the outer one. The moons will not collide, since 


em latitudes of Titan, taken by Cassini in summer 2006. 
The black patches are probably methane lakes. The width 
of the picture is about 450 km. (Photo NASA) 

the speed of the trailing moon increases and the 
moon moves outward. The speed of the leading 
moon decreases and it drops inward. The moons 
exchange their roles roughly every four years. 
There are also several shepherding satellites, like 
Atlas , Prometheus and Pandora that keep rings in 
their place. Their gravitational pull prevents ring 
particles from drifting away. 

The innermost of the “old” moons is Mimas. 
There is a huge crater, named Herschel, on Mi¬ 
mas’ surface with a diameter of 100 km and 
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Fig. 8.25 (Continued) 


a depth of 9 km (Fig. 8.25). Bigger craters exist 
in the solar system, but relative to the size of the 
parent body, this is almost the biggest crater there 
could be room for (otherwise the crater would be 
bigger than Mimas itself). On the opposite side, 
some grooves can be seen, possibly signifying 
that impact has almost torn the moon apart. 

The surface of the next moon, Enceladus, con¬ 
sists of almost pure ice, and one side is nearly 
craterless. Craters and grooves can be found on 
the other hemisphere. Tidal forces result in vol¬ 
canic activity where water (not lava or other "hot” 
material) is discharged to the surface. 

Titan is the largest of the Saturnian moons. 
Its diameter is 5100 km, so it is only slightly 
smaller than Jupiter’s moon Ganymede. Titan is 
the only moon with a dense atmosphere. The at¬ 
mosphere is mainly nitrogen (99 %) and methane, 
and the pressure at the surface is 1.6 bar. The 
temperature is about 90 K. Reddish clouds form 
the visible surface some 200 km above the solid 
body. The Huygens lander made measurements 
and sent images of Titan in 2005. It detected lakes 
of methane as well as geysers ejecting methane, 
ammonia and water. 


8.7 Uranus 

The planets from Mercury to Saturn were already 
known in antiquity. The next planet, Uranus, can¬ 
not be seen with the naked eye. 


The famous German-English amateur astrono¬ 
mer William Herschel discovered Uranus in 1781. 
Herschel himself first thought that the new ob¬ 
ject was a comet. However, the extremely slow 
motion revealed that the body was far beyond 
the orbit of Saturn. Based on the first observa¬ 
tions, the Finnish astronomer Anders Lexell cal¬ 
culated a circular orbit. He was one of the first 
to propose that the newly discovered object was 
a planet. Johann Bode of the Berlin Observatory 
suggested the name Uranus but more than five 
decades passed before the name was unanimously 
accepted. 

The mean distance of Uranus is 19 au, and the 
orbital period 84 years. The inclination of the ro¬ 
tation axis is 98°, which is totally different from 
the other planets. Due to this uncommon geom¬ 
etry, the poles are either lit or in darkness for 
decades. The rotation period, confirmed by the 
Voyager 2 magnetometric measurements in 1986, 
is 17.3 hours; the exact period had been uncertain 
prior to the fly-by. 

Uranus is greenish, as viewed through a tele¬ 
scope. Its colour is due to the strong methane ab¬ 
sorption bands in the near-infrared. A part of the 
red light is also absorbed, leaving the green and 
blue part of the spectrum untouched. Uranus is 
almost featureless (Fig. 8.26) because its clouds 
are below a thick haze or smog. The origin of the 
smog may be methane that solar radiation broke 
into radicals which then formed e.g. asethylene 
and ethane. 
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Fig. 8.26 Two views of Uranus. The left picture shows 
Uranus as it would appear to the naked eye. (NASA). 
At the right there is a Hubble Space Telescope view of 


Uranus surrounded by its rings. Also 10 satellites are vis¬ 
ible in the original picture. (Seidelmann, U.S. Naval Ob¬ 
servatory, and NASA) 



Fig. 8.27 Left: The rings of Uranus are very narrow and scattered forward when the Voyager spacecraft was in the 

composed of a dark material. Nine rings are visible in the shadow of the planet. (NASA) 

picture of Voyager in 1986. Right: Rings seen in the light 


The strong limb darkening caused by the smog 
makes the terrestrial determination of the Uranus’ 
size difficult. Therefore, the radius was not accu¬ 
rately determined until 1977 during a stellar oc- 
cultation caused by Uranus. The rings of Uranus 
were discovered at the same time. 

The internal structure of Uranus is thought 
to be slightly different from that of other giant 


planets. Above the innermost rocky core, there is 
a layer of water, which, in turn, is surrounded by 
a mantle of hydrogen and helium. The mixture of 
water and ammonia and methane therein are dis¬ 
sociated to ions under the heavy pressure. This 
mixture behaves more like a molten salt than wa¬ 
ter. The convection flows in this electrically con¬ 
ductive “sea” give rise to the Uranian magnetic 
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Fig. 8.28 Four Uranian moons (from top left to lower right)'. Miranda, Ariel, Titania and Umbriel. (NASA) 


field. The strength of the magnetic field at the 
cloud tops is comparable to the terrestrial field. 
However, Uranus is much larger then the Earth, 
so the true strength of the field is 50 times greater 
than that of the Earth. The Uranian magnetic field 
is tilted 59° with respect to the rotation axis. No 
other planet has such a highly inclined magnetic 
field. 

The Uranian rings (Fig. 8.27) were discovered 
in 1977, during a stellar occultation. Secondary 
occultations were observed before and after the 
main event. A total of 13 rings are known, nine of 


which were discovered in the occultation. The in¬ 
nermost ring is broad and diffuse. All other rings 
are dark and very narrow, only a few hundred me¬ 
tres or a few kilometres wide. The Voyager 2 re¬ 
sults showed that the rings contain very little dust, 
unlike those of Jupiter and Saturn. The mean size 
of the ring particles is more than 1 metre. The ring 
particles are darker than practically any known 
material in the solar system; the cause of this dark 
colour is unknown. 

There are 27 moons (2016 number) orbiting 
around Uranus, ten of which were discovered by 
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Voyager 2. The geological history of some moons 
is puzzling, and many features reminiscent of an 
active past can be found. 

The innermost of the large moons, Miranda, 
is one of the most peculiar objects discovered 
(Fig. 8.28). It has several geological formations 
also found elsewhere (but here they are all mixed 
together), in addition to the quite unique V- 
shaped formations. It is possible that Miranda’s 
present appearance is the result of a vast colli¬ 
sion that broke the moon apart; some pieces may 
have later settled down, inside out. Another pe¬ 
culiar object is Umbriel. It belongs to the ever 
increasing family of unusual dark bodies (such as 
the Uranian rings, one side of Iapetus and Hal¬ 
ley’s comet). The dark surface of Umbriel is cov¬ 
ered by craters without any traces of geological 
activity. 


8.8 Neptune 

The orbit of Uranus was already well known in 
the beginning of the 19th century. However, some 
unknown perturbations displaced Uranus from 
its predicted orbit. Based on these perturbations, 
John Couch Adams, of Cambridge, and Urbain 


Jean-Joseph Le Verrier, of Paris, independently 
predicted the position of the unknown perturbing 
planet. 

The new planet was discovered in 1846 by Jo¬ 
hann Gottfried Galle at the Berlin Observatory; 
Le Verrier’s prediction was found to be only 1° 
off. The discovery gave rise to a heated contro¬ 
versy as to who should be given the honour of 
the discovery, since Adams’ calculations were 
not published outside the Cambridge Observa¬ 
tory. When the quarrel was settled years later, 
both men were equally honoured. The discovery 
of Neptune was also a great triumph of the New¬ 
tonian theory of gravitation. 

The semimajor axis of the orbit of Neptune 
is 30 au and the orbital period around the Sun 
165 years. The internal rotation period, confirmed 
by Voyager 2 in 1989, is 16 hours 7 minutes 
and the rotation period of the outer layers of the 
clouds is about 17 hours. The obliquity of the ro¬ 
tation axis is 29° but the magnetic field is tilted 
some 47° with respect to the rotation axis. The 
magnetic held is tilted like in Uranus, but the held 
strength is much smaller. 

The density of Neptune is 1638 kg m , and 
the diameter 49,500 km. Thus the density of Nep- 



Fig. 8.29 (Left) Neptune shows more features than 
Uranus. In the picture of Voyager 2 the Great Dark Spot, 
accompanied by bright, white clouds is well visible. Their 
appearance is changing rapidly. To the south of the Great 
Dark Spot is a bright feature and still farther south is an¬ 
other dark spot. Each feature moves eastward at a different 


velocity. (Right) Details of the Southern Dark Spot. The 
V-shaped structure near the right edge of the bright area 
indicates that the spot rotates clockwise. Unlike the Great 
Red Spot on Jupiter, which rotates counterclockwise, the 
material in the Neptune’s dark oval will be descending. 
(NASA/JPL) 
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Fig. 8.30 The rings of Neptune. Ring particles are small 
and best visible in the forward scattered light. There are 
several brightenings in the outermost ring. One of the rings 
appears to have a twisted structure. Neptune at left is over¬ 
exposed. (NASA/JPL) 

tune is higher than that of other giant planets. The 
internal structure is quite simple: The core, com¬ 
posed of silicates (rocks) is about 16,000 km in 
diameter. This is surrounded by a layer of wa¬ 
ter and liquid methane and the outermost gaseous 
layer, atmosphere, is mainly composed of hydro¬ 
gen and helium, methane and ethane being a mi¬ 
nor components. 

Cloud structures are more complicated than on 
Uranus, and some dark spots, like in Jupiter, were 
visible during the Voyager fly-by (Fig. 8.29). The 
speed of the winds are high, up to 400 m/s. 

Like other giants, Neptune also has rings 
(Fig. 8.30). The rings were discovered by Voy¬ 
ager 2, although their existence was already ex¬ 
pected prior the fly-by. Two relatively bright but 
very narrow rings are at a distance of 53,000 and 
62,000 km from the centre of the planet. More¬ 
over, there are some faint areas of fine dust. 

There are 14 known moons, six of which were 
discovered by Voyager 2. Before the Voyager fly- 



Fig. 8.31 The southern hemisphere of Triton, Neptune’s 
largest satellite in a picture taken in 1989 by Voyager 2. 
The dark spots may indicate eruptions of “icy volcanoes”. 
Voyager 2 images showed active geyser-like eruptions 
spewing nitrogen gas and dark dust particles several kilo¬ 
metres into the atmosphere, (NASA) 

by, only two of them, Triton and Nereid, were 
known. 

The largest of the moons, Triton, is 2700 km 
in diameter, and it has a thin atmosphere, mainly 
composed of nitrogen. The albedo is high: Triton 
reflects 60-80 % of the incident light. The sur¬ 
face is relatively young, without any considerable 
impact craters (Fig. 8.31). There are some ac¬ 
tive “geysers” of liquid nitrogen, which partly ex¬ 
plains the high albedo and the lack of the craters. 
The low surface temperature of Triton, 37 K, 
means that the nitrogen is solid and covers the 
surface like snow. It is the lowest surface temper¬ 
ature known in the solar system. 


8.9 Dwarf Planets 

Besides actual planets there are some planet like 
objects that orbit the Sun and are nearly spheri¬ 
cal but have not managed to clear the neighbour¬ 
hood of their orbits. The upper and lower lim¬ 
its to the size and mass of dwarf planets are not 
strictly specified. The lower limit, however, is de¬ 
termined by the hydrostatic equilibrium shape, 
but the size at which this happens may vary ac¬ 
cording to the composition and history of the ob¬ 
ject. It is estimated that up to 40-50 dwarf planets 
will be discovered in coming years. 
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Fig. 8.32 A small portion of the pair of pictures where Pluto was discovered in 1930. The planet is marked with an 
arrow. (Lowell Observatory) 


Currently, there are five dwarf planets in 
the Solar System, namely Ceres, Pluto , Eris, 
Haumea, and Makemake. Ceres was formerly 
counted as an asteroid, Pluto was a planet and 
Eris (2003 UB 313, known also by the nickname 
Xena) was the first Trans-Neptunian object which 
turned out to be larger than Pluto. 

Pluto Pluto was discovered in 1930 at the Low¬ 
ell Observatory, Arizona, after an extensive pho¬ 
tographic search (Fig. 8.32). This search had al¬ 
ready been initiated in the beginning of the cen¬ 
tury by Percival Lowell, on the basis of the pertur¬ 
bations observed in the orbits of Uranus and Nep¬ 
tune. Finally, Clyde Tombaugh discovered Pluto 
less than 6° off the predicted position. However, 
Pluto turned out to be far too small to cause any 
perturbations on Uranus or Neptune. Thus the 
discovery was purely accidental, and the pertur¬ 
bations observed were not real, but caused by mi¬ 
nor errors of old observations. 

The orbit of Pluto is different from planetary 
orbits. The eccentricity is 0.25 and the inclination 
is 17°. During its 250 year orbit, Pluto is closer 
to the Sun than Neptune for 20 years; one such 
period lasted from 1979 to 1999. There is no dan¬ 
ger of Pluto and Neptune colliding, since Pluto 
is high above the ecliptic when at the distance of 
Neptune. Pluto’s orbital period is in a 3:2 reso¬ 
nance with Neptune. 

Pluto has no visible disk as seen with terres¬ 
trial telescopes; instead, it resembles a point, like 


a star. This fact gave an upper limit for the di¬ 
ameter of Pluto, which turned out to be about 
3000 km. The exact mass was unknown until 
the discovery of the Plutonian moon, Charon, in 
1978. The mass of Pluto is only 0.2 % of the 
mass of the Earth. The orbital period of Charon 
is 6.39 days, and this is also the period of rota¬ 
tion of both bodies. Pluto and Charon rotate syn¬ 
chronously, each turning the same side towards 
the other body. The rotation axis of Pluto is close 
to the orbital plane: the tilt is 122°. 

Mutual occultations of Pluto and Charon in 
1985-1987 gave accurate diameters of each body: 
The diameter of Pluto is 2300 km and that of 
Charon, 1200 km. The density of Pluto turned 
out to be about 2100 kgm -3 . Thus Pluto is not 
a huge iceball but about 2/3 of its mass is com¬ 
posed of rocks. The relatively small abundance of 
ices is possibly due to the low temperature dur¬ 
ing the planetary accretion when most of the free 
oxygen was combined with carbon forming car¬ 
bon monoxide. The computed lower limit for wa¬ 
ter ice is about 30 % which is fairly close to the 
value observed in Pluto. 

Pluto has a thin methane atmosphere and there 
is possibly a thin haze over the surface. The sur¬ 
face pressure is 10 -5 -10 -6 atm. It has been spec¬ 
ulated that when Pluto is far from perihelion, the 
whole atmosphere will become frozen and fall on 
the surface. 

Pluto has five satellites. Two of them were dis¬ 
covered by the Hubble Space Telescope in 2005 
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Fig. 8.33 The Hubble Space Telescope photographed 
Pluto and its three known moons in February 2006. The 
smaller moons were found in 2005, and were later named 
Nix and Hydra. Their diameter is estimated as 40-160 
km. (M. Mutchler (STScI), A. Stern (SwRI), and the HST 
Pluto Companion Search Team, ESA, NASA) 



Fig. 8.34 The only detailed images of Pluto were sent by 
the New Horizons probe in 2015. The light heart shaped 
area was named the Tombaugh region to honour the finder 
of Pluto. The dark area to the left of is called the Cthulhu 
region; Cthulhu is a mythical demon in H.P. Lovecraft’s 
horror stories (NASA) 


ing the limit to be in the hydrostatic equilibrium. 
Thus it is no more an asteroid but a dwarf planet. 

The average opposition magnitude of Ceres is 
7.9. Thus it can be seen with binoculars but not 
with the naked eye. 

Nearby images taken by a space probe show a 
few highly reflecting spots. 

Eris In 2003 a Trans-Neptunian object was 
found, initially labelled as 2003 UB313. One of 
the initial nicknames was Xena according to the 
heiress of a tv series. When a moon orbiting Xena 
was found, it was obviously names as Gabrielle. 
However, these names never became official. The 
bigger object was given the name Eris and its 
companion Dysnomia. 

The size of Eris is similar to that of Pluto. The 
size is, however, an estimate based on infrared ra¬ 
diation. Yet it was big enough to be classified as 
a planet if Pluto had retained its planetary status. 
Now Eris is a dwarf planet. The semimajor axis 
of the orbit is 97 au, orbital period 560 years and 
inclination 45°. 


8.10 Minor Bodies 

In addition to planets, moons orbiting the planets, 
dwarf planets and other biggish objects the solar 
system contains a lot of smaller objects that can 
be collectively called minor bodies. 

The class of minor bodies includes aster¬ 
oids, comets, meteoroids, and interplanetary dust. 
However, there is no clear distinction between the 
groups. Some asteroids are cometlike, some as¬ 
teroids coming close to the Earth may have earlier 
been comets that have lost all of their volatile ma¬ 
terial. The classification is based more on the vi¬ 
sual appearance and tradition than physical prop¬ 
erties. 


(Fig. 8.33). They orbit Pluto counterclockwise 
twice the distance of Charon. The New Horizons 
probe found two more moons. 

Ceres Ceres was the first asteroid discovered in 
1801 by Giuseppe Piazzi- It is also the largest of 
the main belt asteroids moving inside Jupiter’s or¬ 
bit. Its diameter is about 1000 km, thus exceed¬ 


8.11 Asteroids 

Asteroids form a large and scattered group of 
Sun-orbiting bodies. The oldest and best-known 
group form the main asteroid belt between Mars 
and Jupiter, with distances of 2.2-3.3 au from the 
Sun (Fig. 8.38). The most distant asteroids are far 
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Fig. 8.35 Top: Ceres was the first detected asteroid. 
Spacecraft images shows its spherical shape, and so it be¬ 
came a dwarf planet. Middle: Vesta is clearly nonspheri- 
cal and has remained as a minor body. It is the brightest 
asteroid: at opposition under ideal conditions it could be 
seen even with the naked eye. Bottom: The asteroid (243) 
Ida and a small moon Dactyl orbiting the asteroid. The ir¬ 
regular shape is typical for most asteroids. Ida is the first 
asteroid known to have a satellite 


beyond the orbit of Pluto, called Trans-Neptunian 
objects. There are also a number of asteroids that 
come closer to the Sun than the Earth. 

Earlier asteroids were called minor planets or 
even planetoids. These names were actually more 
natural than asteroid that only means that the ob¬ 
ject looks a starlike dot, although it has nothing 
to do with stars. 

The first asteroid was found in 1801 (Ceres, 
now a dwarf planet), and at the beginning of 
2016 there were more than 750,000 catalogued 
and 15,000 named asteroids. There were also a 
huge number of asteroids with an approximate or¬ 
bit. The number of catalogued asteroids increases 
currently by thousands every month. It has been 
estimated that more than about half a million as¬ 
teroids larger than 1 km exist in the Solar Sys¬ 
tem. Diameters of asteroids vary from hundreds 
of meters to hundreds of kilometers. The largest 
asteroid Ceres is classified as a dwarf planet and 
the border between smallest asteroids and mete¬ 
oroids is not specified. 

An asteroid observer needs a telescope, since 
even the brightest asteroids are too faint to be 
seen with the naked eye. Asteroids are points of 
light like a star, even if seen through a large tele¬ 
scope; only their slow motion against the stellar 
background reveals that they are members of the 
solar system. 

The rotation of an asteroid gives rise to a reg¬ 
ular light variation. The amplitude of light varia¬ 
tion is in most cases well below 1 magnitude and 
typical rotation periods range from 4 to 15 hours. 

The characteristics of the main belt asteroids 
are best known. Total mass of the main belt as¬ 
teroids is less than 1/1000 of the mass of the 
Earth. The centre of the asteroid belt is at a dis¬ 
tance of approximately 2.8 au, as predicted by the 
Titius-Bode law (Sect. 7.11). According to a for¬ 
merly popular theory, asteroids were thought to 
be debris from the explosion of a planet. This the¬ 
ory, like catastrophe theories in general, has been 
abandoned. 

The currently accepted theory assumes that as¬ 
teroids were formed simultaneously with the ma¬ 
jor planets. The primeval asteroids were large 
chunks, most of them orbiting between the orbits 
of Mars and Jupiter. Due to mutual collisions and 
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Fig. 8.36 Left: Asteroid (951) Gaspra was photographed 
by the Galileo spacecraft in October 1991. The illumi¬ 
nated part of the asteroid is about 16 x 12 km. The smallest 
craters in this view are about 300 m across. Right: A mo- 



Fig. 8.37 Quasimoons or pseudomoons are asteroid 
moving on rather peculiar orbits. They are not real moons, 
since they orbit the Sun, but the orbital period is very close 
to that of some planet. Thus the asteroid seems to oscillate 
around the planet. A quasimoon moves on an eccentric or¬ 
bit inside and outside the orbit of the planet. Eventually the 
asteroid will drift away from the planet but will return after 
a long time. Venus and Neptune each have one quasimoon 
and the Earth has several. The orbits of the quasimoons 
have been studied e.g. Seppo Mikkola in Finland and the 
Canadians Kimmo Innanen and Paul Wiegert 


saic of asteroid (433) Eros was taken by the NEAR space¬ 
craft from a distance of 200 km. The crater on top is about 
5 km in diameter. The NEAR spacecraft orbited Eros for 
one year and finally landed on it in 2001. (JPL/NASA) 

fragmentation, the present asteroids are debris of 
those primordial bodies which were never able to 
form a large planet. Some of the biggest aster¬ 
oids may be those original bodies. The orbital el¬ 
ements of some asteroids are very similar. These 
are called the Hirayamafamilies. They are proba¬ 
bly remnants of a single, large body that was bro¬ 
ken into a group of smaller asteroids. There are 
tens of identified Hirayama families, the largest 
ones including Hungarias, Floras, Eos, Themis, 
and Hildas (named after the main asteroid in the 
group). 

The distribution of asteroids inside the aster¬ 
oid belt is uneven (Fig. 8.38); they seem to avoid 
some areas known as the Kirkwood gaps. The 
most prominent void areas are at distances where 
the orbital period of an asteroid around the Sun 
(given by Kepler’s third law) is in the ratio 1:3, 
2:5, 3:7, or 1:2 to the orbital period of Jupiter. 
The motion of an asteroid orbiting in such a gap 
would be in resonance with Jupiter, and even 
small perturbations would tend to grow with time. 
The body would eventually be moved to another 
orbit. However, the resonance effects are not so 
simple: sometimes an orbit is “locked” to a reso- 
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Fig. 8.38 (a) Most of the 
asteroids orbit the Sun in 
the asteroid belt between 
Mars and Jupiter. The 
figure shows the positions 
of about 96,000 catalogued 
asteroids on January 1, 
2000, and the orbits and 
the positions of some 
major planets. The orbital 
elements of the asteroids 
are from the Lowell 
Observatory data base. 

(b) The total number of 
asteroids as a function of 
the distance from the Sun. 
Each bin corresponds to 
0.1 AU. The empty areas, 
the Kirkwood gaps, are at 
those points, where the 
orbital period of an asteroid 
is in a simple ratio to the 
orbital period of Jupiter 



nance, e.g. the Trojans move along the same orbit 
as Jupiter (1:1 resonance), and the Hilda group is 
in the 2:3 resonance. 

Many groups of asteroids orbit the Sun outside 
the main belt. These include the above-mentioned 
Trojans, which orbit 60° behind and ahead of 
Jupiter. The Trojans, which are close to the spe¬ 
cial points La, and L5 of the solution of the re¬ 
stricted three-body problem. At these Lagrangian 
points, a massless body can remain stationary 
with respect to the massive primaries (in this case, 
Jupiter and the Sun). In fact, the asteroids are 
oscillating around the stationary points, but the 
mean orbits can be shown to be stable against per¬ 
turbations. 

Another large family is the Apollo-Amor as¬ 
teroids. The perihelia of Apollo and Amor are in¬ 


side the Earth’s orbit and between the orbits of the 
Earth and Mars, respectively. These asteroids are 
all small, less than 30 km in diameter. The most 
famous is 433 Eros (Fig. 8.36), which was used in 
the early 20th century for determining the length 
of the astronomical unit. When closest to the 
Earth, Eros is at a distance of only 20 million km 
and the distance can be directly measured using 
the trigonometric parallax. Some of the Apollo- 
Amor asteroids could be remnants of short-period 
comets that have lost all their volatile elements. 

There is a marginal probability that some 
Earth-crossing asteroids will collide with the 
Earth. It has been estimated that, on the average, 
a collision of a large asteroid causing a global 
catastrophe may take place once in one million 
years. Collisions of smaller bodies, causing dam- 
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Fig. 8.39 The Kuiper Belt 
is a disk-shaped cloud of 
distant icy bodies inside 
the halo of the Oort cloud. 
The short-period comets 
originate in the Kuiper belt, 
whereas a huge amount of 
icy bodies that form 
a source of long period 
comets resides in the Oort 
cloud (see Sect. 8.12). 
(JPL/NASA) 



age similar to a nuclear bomb, may happen once 
per century. It has been estimated that there are 
500-1000 near-Earth asteroids larger than one 
kilometre in diameter but possibly tens of thou¬ 
sands smaller objects. Programs have been started 
to detect and catalogue all near-Earth asteroids 
and to predict the probabilities of hazardous col¬ 
lisions. 

Distant asteroids form the third large group 
outside the main asteroid belt. The first asteroid 
belonging to this group (2060) Chiron, was dis¬ 
covered in 1977. Chiron’s aphelion is close to the 
orbit of Uranus and the perihelion is slightly in¬ 
side the orbit of Saturn. Distant asteroids are very 
faint and thus difficult to find. 

Already in the 1950’s Gerard Kuiper sug¬ 
gested that comet-like debris from the formation 
of the solar system can exist beyond the orbit of 
Neptune as an additional source of comets to the 
more distant Oort cloud. Later, computer simula¬ 
tions of the solar system’s formation showed that 
a disk of debris should form at the outer edge of 
the solar system. The disk is now known as the 
Kuiper belt (Fig. 8.39). 

The first Trans-Neptunian asteroid (1992 QB1) 
was discovered in 1992, and in the beginning of 


year 2006 there were about 1000 known mem¬ 
bers. The total number of Kuiper belt objects 
larger than 100 km in diameter is estimated to 
be over 70,000. Some of them may be even larger 
than Pluto. The Kuiper belt objects are remnants 
from the early accretion phases of the solar sys¬ 
tem. Several of the Trans-Neptunian objects are 
in or near a 3:2 orbital period resonance with 
Neptune, the same resonance as Pluto. Therefore 
they are called plutinos. 

The exact sizes of asteroids were long un¬ 
known. Edward E. Barnard of the Lick Ob¬ 
servatory determined visually the diameters of 
(1) Ceres, (2) Vesta, (3) Juno, and (4) Pallas in 
the 1890’s (Fig. 8.39). Practically no other re¬ 
liable results existed prior to the 1960’s, when 
indirect methods applying photometry and spec¬ 
troscopy were adopted. Moreover, several stellar 
occultations caused by asteroids have been ob¬ 
served since 1980’s. 

The first images of asteroids were obtained in 
the early 1990’s. In 1991 the Galileo spacecraft 
passed asteroid (951) Gaspra, and in 1993 as¬ 
teroid (243) Ida, on its long way to Jupiter (see 
Sect. 8.15). Finally, in 2001, the NEAR space- 
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Fig. 8.40 Sizes of some asteroids compared with the 
Moon. (Moon image, NASA) 

craft landed on asteroid (433) Eros after orbiting 
it for one year. 

The images of asteroids (Fig. 8.35) show ir¬ 
regular, crater-filled bodies with regolith and pul¬ 
verised rock on their surface. Some asteroids may 
once have been two separate objects that merged 
into one. In 1992 asteroid (4179) Toutatis passed 
the Earth only by 4 million kilometres. Radar im¬ 
ages revealed a two-body system, where the com¬ 
ponents were touching each other. Double aster¬ 
oids may be quite common, and there exist light 
curves of some asteroids which have been inter¬ 
preted as results of twin bodies. Another example 
of a twin asteroid is 243 Ida that has a “moon”, 
a smaller body gravitationally bound to it. 

The composition of main belt asteroids is sim¬ 
ilar to that of iron, stone and iron-stone mete¬ 
orites. Most asteroids can be divided into three 
groups, according to their photometric and po- 
larimetric properties. 95 % of the classified aster¬ 
oids belong to the types C and S types. Metal-rich 
M type asteroids are rarer. 

About 75 percent of asteroids belong to the 
type C type. The C asteroids are dark due to 
radiation darkening (geometric albedo p & 0.06 
or less), and they contain a considerable amount 
of carbon (mnemonic C for carbon). They re¬ 
semble stony meteorites. The material is undif¬ 


ferentiated and thus they belong to the most pri¬ 
mordial bodies of the solar system. The reflec¬ 
tivity of silicate-rich S asteroids is higher and 
their spectra are close to those of stone-iron me¬ 
teorites. Their spectra show signs of silicates, 
such as olivine, e.g. fosterite Mg 2 SiC >4 or fay- 
alite FeiSiOr. M type asteroids have more met¬ 
als, mostly nickel and iron; they have undergone 
at least a partial differentiation. 

The compositions and even sizes of the Trans- 
Neptunian objects are difficult to determine. They 
are dim, and due to their low temperature, the 
black-body radiation maximum is around 60 pm. 
This wavelength is almost impossible to observe 
on the Earth. Even the estimations of the albedos, 
and therefore the diameter are very uncertain. 

Colours of TNOs range from blue-grey to red 
and the distribution appears to be uniform. How¬ 
ever, population of the low-inclination objects 
seem to be red and high-inclination objects blue. 
The unperturbed orbits of the low-inclination ob¬ 
jects suggest that they represent a relic of the 
original population of the Kuiper belt. 

Interpretations of the spectra are ambiguous 
and spectra may not describe the composition of 
the whole object. The surface is altered by in¬ 
tense radiation, solar wind and micrometeorites 
and it can be quite different from the regolith and 
deeper layers underneath. 

Small TNOs are probably mixtures of rock 
and ice with some organic surface material. The 
composition is similar to the comets. High den¬ 
sity (2000-3000 kgm -3 ) of some large objects 
suggests a high non-ice content, similar to Pluto. 


8.12 Comets 

Comets are agglomerates of ice, snow, and dust; 
a typical diameter is of the order of 10 km or less. 
The nucleus contains icy chunks and frozen gases 
with embedded rock and dust. At its centre, there 
can be a small, rocky core. 

Comets are the only celestial objects named 
after their finders. The letter P (periodic) preced¬ 
ing some names indicates that planetary perturba¬ 
tions have changed the orbit so that the comet has 
remained orbiting the Sun. 
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A comet is invisible when far from the Sun; 
when it gets closer than about 2 au. the heat of 
the Sun starts to melt the ice and snow. The out¬ 
flowing gas and dust form an envelope, the coma 
around the nucleus. Radiation pressure and the 
solar wind push ionised gas and dust away from 
the Sun, resulting in the typical long-tailed shape 
of a comet (Fig. 8.42). 

The tail is always pointing away from the Sun, 
a fact which was noticed in the 16th century. Usu¬ 
ally, there are two tails, an ion tail (gas tail) and 
a dust tail. The partly ionised gas and very fine 
dust in the ion tail are driven by the solar wind. 
Some of the light is reflected solar light, but the 
brightness of the ion tail is mostly due to emis¬ 
sion by the excited atoms. The dust tail is caused 
by the radiation pressure. Because the velocities 
of the particles of the dust tail are lower than the 
velocities in the ion tail, the dust tail is often more 
curved than the ion tail. 

Fred Whipple introduced in 1950’s a “dirty 
snowball ” theory to describe the cometary struc¬ 
ture. According to this model, cometary nuclei 



Fig. 8.41 Lower right: A composite image of the nu¬ 
cleus of comet P/Halley taken by ESA Giotto spacecraft in 
1986. The size of the nucleus is approximately 13x7 km. 
Dust jets are originating from two regions on the nucleus. 
(ESA/Max Planck Institut ftir Aeronomie) 


are composed of ice mixed with gravel and dust. 
The observations have revealed that the classi¬ 
cal dirty snowball model is not quite accurate; 
at least the surface is more dirt than snow, also 
containing organic compounds. Several chemical 
compounds have been observed, including water 
ice, which probably makes up 75-80 % of the 
volatile material. Other common compounds are 
carbon monoxide (CO), carbon dioxide (CO 2 ), 
methane (CH 4 ), ammonia (NH 3 ), and formalde¬ 
hyde (H 2 CO). 

The most famous (and also best known) pe¬ 
riodic comet is Halley’s comet (Fig. 8.42). Its or¬ 
bital period is about 76 years; it was last in perihe¬ 
lion in 1986. During the last apparition, the comet 
was also observed by spacecraft, revealing the 
solid cometary body itself for the first time. Hal¬ 
ley is a 13 x 7 km, peanut-shaped chunk whose 
surface is covered by an extremely black layer of 
a possibly tar-like organic or other similar mate¬ 
rial. Violent outbursts of gas and dust make an ex¬ 
act prediction of its brightness impossible, as of¬ 
ten noticed when cometary magnitudes have been 
predicted. Near the perihelion, several tons of gas 
and dust burst out every second. 

Cometary material is very loose. Ablation of 
gas and dust, large temperature variations and 
tidal forces sometimes cause the whole comet to 
break apart. Comet Shoemaker-Levy 9 which im¬ 
pacted into Jupiter in 1994 was torn apart two 
years earlier when it passed Jupiter at a dis¬ 
tance of 21,000 km (Fig. 8.44). The impact of 
Shoemaker-Levy 9 showed that there can be den¬ 
sity variation (and perhaps variation in composi¬ 
tion, too) inside the original cometary body. 

Comets are rather ephemeral things, surviving 
only a few thousand revolutions around the Sun 
or less. The short-period comets are all newcom¬ 
ers and can survive only a short time here, in the 
central part of the solar system. 

Since comets in the central solar system are 
rapidly destroyed, there has to be some source 
of new short-period comets. In 1950 Jan Oort 
discovered a strong peak for aphelia of long pe¬ 
riod comets at a distance of about 50,000 au, and 
that there is no preferential direction from which 
comets come (Fig. 8.43). He proposed that there 
is a vast cloud of comets at the outer reaches of 
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Fig. 8.42 Top: Comet Mrkos in 1957. The tail typical to 
comets is caused by the solar wind and radiation pressure 
when the comet approaches the Sun (Palomar Observa- 

the solar system, now know as the Oort cloud 
(Fig. 8.45). The total mass of the Oort cloud is 
estimated to be tens of Earth masses, containing 
more than 10 12 comets. 

A year later Gerard Kuiper showed that there 
is a separate population of comets. Many of 
the short period comets, with periods less than 
200 years, have the orbital inclination less than 
40°, and they orbit the Sun in the same direction 
as the Earth. The orbital inclination of long period 
comets are not peaked around the plane of the 
ecliptic but they are more random. Kuiper argued 


tory). Down: The comet Churyumov-Gerasimenko is the 
first comet where a space probe has landed and sent de¬ 
tailed closeup images. (ESA) 

that the short period comets originate from a sep¬ 
arate population of comets that resides in a disk¬ 
like cloud beyond the orbit of Neptune. The area 
is now known as the Kuiper belt (Fig. 8.39). 

Occasionally perturbations from passing stars 
send some of the comets in the Oort cloud into or¬ 
bits, which bring them into the central parts of the 
solar system, where they are seen as long-period 
comets. Around a dozen “new” comets are dis¬ 
covered each year. Most of these are visible only 
with a telescope, and only a couple of times per 
decade one can see a bright naked-eye comet. 
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Some of the long period comets are put into 
short period orbits by the perturbations of Jupiter 
and Saturn, whereas some others can be ejected 
from the solar system. However, there are no 
comets that have been proven to come from inter¬ 
stellar space, and the relative abundances of sev¬ 
eral isotopes in the cometary matter are the same 
as in other bodies of our solar system. 

The origin of the Oort cloud and Kuiper belt 
is different. The Oort cloud objects were formed 
near the giant planets and have been ejected to 
the outer edge of the solar system by gravitational 
perturbations soon after the formation of the solar 
system. Small objects beyond the orbit of Nep- 



Fig. 8.43 Orbits of short period comets projected to the 
plane of the ecliptic 


tune had no such interactions and they remained 
near the accretion disk. 

In 2014 ESA’s Rosetta probe settled on an or¬ 
bit around the comet Churyumov-Gerasimenko 
and sent the Philae lander to its surface. The lan¬ 
der found several organic compounds and hard 
ice below a dust layer about 20 centimetres thick. 
The water vapour emanating from the comet was 
different from the terrestrial water: the ratio of 
deuterium to ordinary hydrogen was three times 
higher than on the Earth. This suggests that the 
source of water on the Earth is not comets, at least 
not of those like Churyumov-Gerasimenko. 


8.13 Meteoroids 

Solid bodies smaller than asteroids are called me¬ 
teoroids. The boundary between asteroids and 
meteoroids, however, is diffuse; it is a matter of 
taste whether a ten metre body is called an as¬ 
teroid or a meteoroid. We could say that it is an 
asteroid if it has been observed so often that its 
orbital elements are known. 

When a meteoroid hits the atmosphere, an 
optical phenomenon, called a meteor (“shooting 
star”) is seen (Fig. 8.47). The smallest bodies 
causing meteors have a mass of about 1 gram; the 
(micro)meteoroids smaller than this do not result 
in optical phenomena. However, even these can 
be observed with radar which is able to detect 
the column of ionised air. Micrometeoroids can 



Fig. 8.44 Comet Shoemaker-Levy 9 five months before its collision to Jupiter as seen by the Hubble Space Telescope. 
(JPL/NASA) 
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Fig. 8.45 A schematic diagram of the distribution of the 
semimajor axes of long-period comets. The abscissa is 
the inverse of the semimajor axis, 1/a [AU] -1 . The Oort 
cloud is visible as a strong peak at the very small positive 
values of 1/a. The orbits shown here are the "original or¬ 
bits”, i.e. computed backward in time to remove all known 
perturbations 

also be studied with particle detectors installed 
in satellites and space crafts. Bright meteors are 
called bolides. 

The number of meteoroids increases rapidly as 
their size diminishes. It has been estimated that 
at least 10 5 kg of meteoritic material falls on the 
Earth each day. Most of this material is microm¬ 
eteoroids and causes no visible phenomena. 

Due to perspective, all meteors coming from 
the same direction seem to radiate from the same 
point. Such meteor streams (meteor showers) are, 
e.g. the Perseids in August and the Geminides in 
December; the names are given according to the 
constellation in which the radiation point seems 
to be. On the average, one can see a few sporadic 
meteors (Fig. 8.46) per hour. During a strong me¬ 
teor shower one can see even tens of meteors per 
minute, although a normal rate is some tens per 
hour. 

Most of the meteoroids are small and burn to 
ashes at a height of 100 km. However, larger bod¬ 
ies may come through and fall to the Earth. These 
are called meteorites. The relative speed of a typi¬ 
cal meteoroid varies in the range 10-70 km/s. The 
speed of the largest bodies does not diminish in 
the atmosphere; thus, they hit the Earth at their 
cosmic speeds, resulting in large impact craters. 
Smaller bodies slow down and drop like stones 



Fig. 8.46 When the Earth is moving on its orbit, the 
morning hemisphere is the leading one. Hence most of the 
meteoroids coming from random directions hit the morn¬ 
ing hemisphere. Thus most meteors can be seen between 
midnight and sunrise 



Fig. 8.47 Meteors are easy to capture on film: one just 
leaves a camera loaded with a sensitive film on a tripod 
with the shutter open for an hour or so. Stars make curved 
trails on the film. (L. Hakkinen) 

but impacts of large bodies (diameter meters or 
more) may cause large-scale disaster. 

Iron meteorites or irons, composed of almost 
pure nickel-iron, comprise about one quarter of 
all meteorites. Actually the irons are in a minor¬ 
ity among meteoroids, but they survive their vio¬ 
lent voyage through the atmosphere more easily 
than weaker bodies. Three-quarters are stony me¬ 
teorites, or stone-iron meteorites. 
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Fig. 8.48 A projection of 
the entire infrared sky 
created from observations 
of the COBE satellite. The 
bright horizontal band is 
the Milky Way. The dust of 
the solar system, visible on 
the Earth as zodiacal light 
is an S-shaped glow across 
the image. (G. Greaney and 
NASA) 



Meteoroids themselves can be divided into 
three groups of roughly equal size. One-third 
is ordinary stones, chondrites. The second class 
contains weaker carbonaceous chondrites and 
the third class includes cometary material, loose 
bodies of ice and snow which are unable to sur¬ 
vive down to the Earth. 

Many meteor streams are in the same orbit as 
a known comet, so at least some meteoroids are of 
cometary origin. Near a perihelion passage, every 
second several tons of gravel is left on the orbit of 
a comet. There are several examples of meteorites 
that have their origin in the Moon or Mars. De¬ 
bris of large impacts may have been ejected into 
space and finally ended up on the Earth. Some 
meteoroids are debris of asteroids. 


8.14 Interplanetary Dust and Other 
Particles 

Two faint light phenomena, namely zodiacal light 
and gegenschein (counterglow) make it possible 
to observe interplanetary dust, small dust parti¬ 
cles reflecting the light of the Sun. This weak 
glow can be seen above the rising or setting 
Sun (zodiacal light) or exactly opposite the Sun 
(gegenschein). The interplanetary dust is concen¬ 
trated near the plane of the ecliptic. The typ¬ 
ical sizes of the particles are in the range of 
10-100 pm. 

Solar Wind Elementary particles hitting the 
Earth originate both in the Sun and outside the 
solar system. Charged particles, mainly protons, 


electrons and alpha particles (helium nuclei) flow 
continuously out of the Sun. At the distance of 
the Earth, the average speed of this solar wind is 
about 500 krn/s. The velocity and number of par¬ 
ticles depend on eruptions and activity of the Sun 
(Sects. 13.3 and 13.4). 

The charged particles interact with the solar 
magnetic field. The strength of the solar magnetic 
field at the Earth’s distance is about 1/1000 of 
that of the Earth. Particles coming from outside 
the solar system are called cosmic rays, which 
contain both elementary particles and heavier nu¬ 
clei. 


8.15 Examples 

Example 8.1 (The Roche Limit) A French math¬ 
ematician Edouard Roche computed in 1848 
a limit where a moon will be torn apart due to 
the tidal forces if it approaches its parent planet. 
Roche proposed that the Saturnian rings were 
formed in that way. 



We can compute the Roche limit for a body at 
a distance R of a planet the mass of which is M 
approximating the body with two small spheres 
of radius r and mass m. The difference of grav¬ 
itation affecting the small spheres by the planet 
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is 


A F = GMm 

»r 


1 


1 


L(fl-r) 2 (R + r) 2 


4 r 

GMm — T . 
R 3 


The gravitational force between the small spheres 
is 

Gm 2 


F' = 


4 r 2 


If A F > F', the small spheres will be pulled 
apart. The forces are equal at the Roche limit: 

4r Gm 2 
GMm — — . 

R 3 4 r 2 

Thus the distance of the Roche limit R is 


R — 


16 r 3 M 


Inserting the masses of the planet and the spheres 
in terms of the radii of the planet S and spheres r, 
and assuming that the densities p are equal, m = 
j 7 r/- 3 p, M — p, we obtain 

R& 2.5 x S. 


Our result is valid only for a body without any 
internal strength. Smaller bodies with internal 
strength can survive inside the Roche limit. You, 
dear reader, act as an excellent example of this, 
because you read this example well inside the 
Roche limit of the Earth. A 100 km stony asteroid 
will survive even if it orbits the Earth just above 
the atmosphere but a sphere of water of the same 
size would break apart. 


8.16 Exercises 

Exercise 8.1 The interval between two opposi¬ 
tions of a planet was 398.9 d. The angular di¬ 


ameter of the planet at the opposition was 47.2". 
Find the sidereal period, semimajor axis, and the 
true diameter (in kilometres) of the planet. Which 
planet was it? 

Exercise 8.2 (a) Assume that three bodies move 
along circular orbits with angular velocities (mean 
motions) n%, ;?2 and M 3 . Show that these bodies 
have a common synodic period if and only if there 
are nonzero integers k \, A :2 and A 3 such that 

k\n\ + & 2«2 + A 3 M 3 = 0 , k 1 + A 2 + A 3 = 0 . 

(b) The resonance of the Galilean satellites can 
be expressed in terms of their mean motions as 

«Io 3Mg ur0 p a + — M ci an y me tie — 0. 

Find the synodic period of these three moons. 

Exercise 8.3 The rotation period of Mars is 
24.62 hours and the orbital period of Deimos 
30.30 hours. How long is the time between two 
successive risings of Deimos and how much 
Deimos moves in one hour as seen from Mars? 

Exercise 8.4 Find the distance of the Roche 
limit for all planets and see if any of their moons 
is orbiting the limit. And how are the rings of gi¬ 
ant planets situated relative to the Roche limit? 

Exercise 8.5 Assume that the hydrogen atom is 
a solid sphere with a radius of 5.3 x 10 “ 11 m (the 
Bohr radius) and mass 1.67 x 10 -27 kg. Use such 
balls to build a regular (infinite) grid where each 
ball is touching its six neighbours. What is the 
density of such matter? Compare with the density 
of Jupiter. 
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All our information about the physical proper¬ 
ties of stars comes more or less directly from 
studies of their spectra. In particular, by study¬ 
ing the strength of various absorption lines, stel¬ 
lar masses, temperatures and compositions can be 
deduced. The line shapes contain detailed infor¬ 
mation about atmospheric processes. 

As we have seen in Chap. 3, the light of a star 
can be dispersed into a spectrum by means of 
a prism or a diffraction grating. The distribution 
of the energy flux density over frequency can then 
be derived. The spectra of stars consist of a con¬ 
tinuous spectrum or continuum with narrow spec¬ 
tral lines superimposed (Fig. 9.1). The lines in 
stellar spectra are mostly dark absorption lines, 
but in some objects bright emission lines also oc¬ 
cur. 

In a very simplified way the continuous spec¬ 
trum can be thought of as coming from the hot 
surface of the star. Atoms in the atmosphere 
above the surface absorb certain characteristic 
wavelengths of this radiation, leaving dark “gaps” 
at the corresponding points in the spectrum. In 
reality there is no such sharp separation between 
surface and atmosphere. All layers emit and ab¬ 
sorb radiation, but the net result of these pro¬ 
cesses is that less energy is radiated at the wave¬ 
lengths of the absorption lines. 

The spectra of stars are classified on the ba¬ 
sis of the strengths of the spectral lines. Isaac 
Newton observed the solar spectrum in 1666, but, 
properly speaking, spectroscopy began in 1814 
when Joseph Fraunhofer observed the dark lines 
in the spectrum of the Sun. He assigned capi¬ 


tal letters, like D, G, H and K, to some of the 
stronger dark lines without knowing the elements 
responsible for the origin of the lines (Sect. 9.2). 
The absorption lines are also known as Fraun¬ 
hofer lines. In 1860, Gustav Robert Kirchhoff and 
Robert Bunsen identified the lines as the charac¬ 
teristic lines produced by various elements in an 
incandescent gas. 


9.1 Measuring Spectra 

The most important methods of forming a spec¬ 
trum are by means of an objective prism or a slit 
spectrograph. In the former case one obtains 
a photograph, where each stellar image has been 
spread into a spectrum. Up to several hundred 
spectra can be photographed on a single plate 
and used for spectral classification. The amount 
of detail that can be seen in a spectrum depends 
on its dispersion , the range of wavelengths per 
millimetre on the plate (or per pixel on a CCD). 
The dispersion of an objective prism is a few 
tens of nanometres per millimetre. More detailed 
observations require a slit spectrograph, which 
can reach a dispersion 1-0.01 nm/rnm. The de¬ 
tailed shape of individual spectral lines can then 
be studied. 

The photograph of the spectrum is converted 
to an intensity tracing showing the flux density as 
a function of wavelength. This is done by means 
of a microdensitometer, measuring the amount of 
light transmitted by the recorded spectrum. Since 
the blackening of a photographic plate is not lin¬ 
early related to the amount of energy it has re- 
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Fig. 9.1 A typical stellar spectrum. The continuous spec¬ 
trum is brightest at about 550 nm and gets fainter towards 
shorter and longer wavelengths. Dark absorption lines are 


superimposed on the continuum. The spectrum of the star, 
rj Pegasi, is very similar to that of our Sun. (Mt. Wilson 
Observatory) 




Fig. 9.2 (a) A section of a photograph of a stellar spec¬ 
trum and the corresponding rectified microdensitometer 
intensity tracing. The original spectrum was taken at the 
Crimean Observatory, (b) A more extensive part of the 

ceived, the measured blackening has to be cali¬ 
brated by comparison with known exposures. 

Nowadays CCD cameras are used also in spec¬ 
trographs and the intensity curve is determined 
directly without the intervening step of a pho¬ 
tographic plate. Some image processing is still 
needed since the height of the spectrum is sev¬ 
eral pixels and the spectrum may not be aligned 
with the pixel rows of the camera. 

For measurements of line strengths the spec¬ 
trum is usually rectified by dividing by the con¬ 
tinuum intensity. Figure 9.2 shows a photograph 
of the spectrum of a star and the intensity curve 
obtained from a calibrated and rectified micro¬ 


spectrum. (c) The picture the intensity curve of the first 
picture has been rectified by normalising the value of the 
continuum intensity to one. (Pictures by J. Kyrolainen and 
H. Virtanen, Helsinki Observatory) 

densitometer tracing. The second pair of pic¬ 
tures shows the intensity curve before and after 
the normalisation. The absorption lines appear as 
troughs of various sizes in the curve. In addition 
to the clear and deep lines, there are large num¬ 
bers of weaker lines that can barely be discerned. 
The graininess of the photographic emulsion is 
a source of noise which appears as irregular fluc¬ 
tuations of the intensity curve. Some lines are so 
close together that they appear blended at this dis¬ 
persion. 

The detailed shape of a spectral line is called 
the line profile (Sect. 5.3). The true shape of the 
line reflects the properties of the stellar atmo- 
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sphere, but the observed profile is also spread out 
by the measuring instrument. However, the total 
absorption in the line, usually expressed in terms 
of the equivalent width, is less sensitive to obser¬ 
vational effects (see Fig. 5.6). 

The equivalent width of a spectral line de¬ 
pends on how many atoms in the atmosphere are 
in a state in which they can absorb the wave¬ 
length in question. The more atoms there are, the 
stronger and broader the spectral line is. For ex¬ 
ample, a typical equivalent width of a metal line 
(Fe) in the solar spectrum is about 10 pm. Line 
widths are often expressed in angstroms (1 A = 
10 -10 m = 0.1 nm). 

Only in weak lines the equivalent width de¬ 
pends linearly on the number of absorbing atoms. 
The equivalent width as a function of the amount 
of absorbing atoms is known as the curve of 
growth. It is, however, beyond the scope of this 
book. 

Line profiles are also broadened by the Dopp¬ 
ler effect. In stellar atmospheres there are motions 
of small and large scale, like thermal motion of 
the atoms and convective flows. 

The chemical composition of the atmosphere 
can be determined from the strengths of the spec¬ 
tral lines. With the introduction of large com¬ 
puters it has become feasible to construct quite 
detailed models of the structure of stellar atmo¬ 
spheres, and to compute the emergent spectrum 
for a given model. The computed synthetic spec¬ 
trum can be compared with the observations and 
the theoretical model modified until a good fit 
is obtained. The theoretical models then give the 
number of absorbing atoms, and hence the ele¬ 
ment abundances, in the atmosphere. The con¬ 
struction of model atmospheres will be discussed 
in Sect. 9.6. 


9.2 The Harvard Spectral 
Classification 

The spectral classification scheme in present use 
was developed at Harvard Observatory in the 
United States in the early 20th century. The work 
was begun by Henry Draper who in 1872 took 
the first photograph of the spectrum of Vega. 
Later Draper’s widow donated the observing 


equipment and a sum of money to Harvard Ob¬ 
servatory to continue the work of classification. 

The main part of the classification was done by 
Annie Jump Cannon using objective prism spec¬ 
tra. The Henry Draper Catalogue (HD) was pub¬ 
lished in 1918-1924. It contains 225,000 stars ex¬ 
tending down to 9 magnitudes. Altogether more 
than 390,000 stars were classified at Harvard. 

The Harvard classification is based on lines 
that are mainly sensitive to the stellar tempera¬ 
ture, rather than to gravity or luminosity. Impor¬ 
tant lines are the hydrogen Balmer lines, the lines 
of neutral helium, the iron lines, the H and K dou¬ 
blet of ionised calcium at 396.8 and 393.3 nm, 
the G band due to the CH molecule and some 
metals around 431 nm, the neutral calcium line at 
422.7 nm and the lines of titanium oxide (TiO). 

The main types in the Harvard classification 
are denoted by capital letters. They were ini¬ 
tially ordered in alphabetical sequence, but subse¬ 
quently it was noticed that they could be ordered 
according to temperature. With the temperature 
decreasing towards the right the sequence is 

C 

O-B-A-F-G-K-M-L-T. 

S 

Additional notations are Q for novae, P for plan¬ 
etary nebulae and W for Wolf-Rayet stars. The 
class C consists of the earlier types R and N. 
The spectral classes C and S represent parallel 
branches to types G-M, differing in their surface 
chemical composition. The most recent addition 
are the spectral classes L and T continuing the 
sequence beyond M, representing brown dwarfs. 
There is a well-known mnemonic for the spectral 
classes, but due to its chauvinistic tone we refuse 
to tell it. 

The spectral classes are divided into sub¬ 
classes denoted by the numbers 0,..., 9; some¬ 
times decimals are used, e.g. B 0.5 (Figs. 9.3 
and 9.4). 

Classes at the beginning of the sequence are 
sometimes called early classes and those at the 
end of the sequence late classes. This is no way 
related to stellar evolution; it only reflects the po¬ 
sition in the sequence O-B-A-F-G-K-M. 
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Fig. 9.3 Spectra of early and late spectral type stars be¬ 
tween 375 and 390 nm. (a) The upper star is Vega, of 
spectral type AO, and (b) the lower one is Aldebaran, of 

Spectra of brown dwarfs are shown in Fig. 9.4 
and compared with those of M dwarfs. 

The main characteristics of the different classes 
are: 

O Blue stars, surface temperature 20,000- 
35,000 K. Spectrum with lines from multiply 
ionised atoms, e.g. Hell, CIII, NIII, OIII, 
Si V. He I visible, HI lines weak. 

B Blue-white stars, surface temperature about 
15,000 K. Hell lines have disappeared. He I 
(403 nm) lines are strongest at B2, then get 
weaker and have disappeared at type B9. The 
K line of Call becomes visible at type B3. 
H I lines getting stronger. OII, Si II and Mg II 
lines visible. 

A White stars, surface temperature about 
9000 K. The HI lines are very strong at 
A0 and dominate the whole spectrum, then 
get weaker. H and K lines of Call getting 
stronger. He I no longer visible. Neutral metal 
lines begin to appear. 

F Yellow-white stars, surface temperature about 
7000 K. HI lines getting weaker, H and K of 
Ca II getting stronger. Many other metal lines, 
e.g. Fel, Fell, CrII, Till, clear and getting 
stronger. 

G Yellow stars like the Sun, surface tempera¬ 
ture about 5500 K. The HI lines still getting 
weaker, H and K lines very strong, strongest 
at GO. Metal lines getting stronger. G band 
clearly visible. CN lines seen in giant stars. 

K Orange-yellow stars, surface temperature 
about 4000 K. Spectrum dominated by metal 
lines. HI lines insignificant. Cal 422.7 nm 
clearly visible. Strong H and K lines and 
G band. TiO bands become visible at K5. 


spectral type K5. The hydrogen Balmer lines are strong in 
the spectrum of Vega; in that of Aldebaran, there are many 
metal lines. (Lick Observatory) 

M Red stars, surface temperature about 3000 K. 
TiO bands getting stronger. Cal 422.7 nm 
very strong. Many neutral metal lines. 

L Brown (actually dark red) stars, surface tem¬ 
perature about 2000 K. The TiO and VO 
bands disappear for early L class. Very strong 
and broad lines of Na I and KI. 

T Brown dwarfs, surface temperature about 
1000 K. Very strong molecular absorption 
bands of CH 4 and H 2 O. 

C Carbon stars, previously R and N. Very red 
stars, surface temperature about 3000 K. 
Strong molecular bands, e.g. C 2 , CN and CH. 
No TiO bands. Line spectrum like in the types 
K and M. 

S Red low-temperature stars (about 3000 K). 
Very clear ZrO bands. Also other molecular 
bands, e.g. YO, LaO and TiO. 

The main characteristics of the classification 
scheme can be seen in Fig. 9.5 showing the vari¬ 
ations of some typical absorption lines in the dif¬ 
ferent spectral classes. Different spectral features 
are mainly due to different effective temperatures. 
Different pressures and chemical compositions of 
stellar atmospheres are not very important factors 
in the spectral classification, except in some pe¬ 
culiar stars. 

The early, i.e. hot, spectral classes are char¬ 
acterised by the lines of ionised atoms, whereas 
the cool, or late, spectral types have lines of neu¬ 
tral atoms. In hot stars molecules dissociate into 
atoms; thus the absorption bands of molecules ap¬ 
pear only in the spectra of cool stars of late spec¬ 
tral types. 

To see how the strengths of the spectral lines 
are determined by the temperature, we consider. 
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Fig. 9.4 (a) Intensity curves for various spectral classes 
showing characteristic spectral features. The name of the 
star and its spectral and luminosity class are given next 
to each curve, and the most important spectral features 
are identified. (Drawing by J. Dufay.) (b) Optical spec- 

for example, the neutral helium lines at 402.6 nm 
and 447.2 nm. These are only seen in the spec¬ 
tra of hot stars. The reason for this is that the 
lines are due to absorption by excited atoms, 


tra of M stars and brown dwarfs. In an approximate sense, 
the brown dwarfs continue the spectral sequence towards 
lower temperatures, although in many respects they dif¬ 
fer from the earlier spectral types. (J.D. Kirkpatrick 2005, 
ARAA 43, 205) 

and that a high temperature is required to pro¬ 
duce any appreciable excitation. As the stellar 
temperature increases, more atoms are in the re¬ 
quired excited state, and the strength of the he- 
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Fig. 9.5 Equivalent 
widths of some important 
spectral lines in the various 
spectral classes. [Struve, O. 
(1959): Elementary 
Astronomy (Oxford 
University Press, New 
York) p. 259] 


Temperature 

25.000 10,000 6.000 3.600 



lium lines increases. When the temperature be¬ 
comes even higher, helium begins to be ionised, 
and the strength of the neutral helium lines be¬ 
gins to decrease. In a similar way one can under¬ 
stand the variation with temperature of other im¬ 
portant lines, such as the calcium H and K lines. 
These lines are due to singly ionised calcium, and 
the temperature must be just right to remove one 
electron but no more. 

The hydrogen Balmer lines H^, H y and are 
strongest in the spectral class A2. These lines cor¬ 
respond to transitions to the level the principal 
quantum number of which is n — 2. If the tem¬ 
perature is too high the hydrogen is ionised and 
such transitions are not possible. 


9.3 The Yerkes Spectral 
Classification 

The Harvard classification only takes into ac¬ 
count the effect of the temperature on the spec¬ 
trum. For a more precise classification, one also 
has to take into account the luminosity of the star, 
since two stars with the same effective tempera¬ 
ture may have widely different luminosities. 

A two-dimensional system of spectral classi¬ 
fication was introduced by William W. Morgan, 
Philip C. Keenan and Edith Kellman of Yerkes 
Observatory. This system is known as the MKK 
or Yerkes classification. (The MK classification is 
a modified, later version.) The MKK classifica¬ 
tion is based on the visual scrutiny of slit spec¬ 
tra with a dispersion of 11.5 nm/mm. It is care¬ 
fully defined on the basis of standard stars and the 


specification of luminosity criteria. Six different 
luminosity classes are distinguished: 

- Ia most luminous supergiants, 

- Ib less luminous supergiants, 

- II luminous giants, 

- Ill normal giants, 

- IV subgiants, 

- V main sequence stars (dwarfs). 

The luminosity class is denoted by a Roman 
numeral after the spectral class. For example, the 
class of the Sun is G2 V. 

The luminosity class is determined from spec¬ 
tral lines that depend strongly on the stellar sur¬ 
face gravity, which is closely related to the lu¬ 
minosity. The masses of giants and dwarfs are 
roughly similar, but the radii of giants are much 
larger than those of dwarfs. Therefore the grav¬ 
itational acceleration g — GM/R 2 at the surface 
of a giant is much smaller than for a dwarf. In 
consequence, the gas density and pressure in the 
atmosphere of a giant is much smaller. This gives 
rise to luminosity effects in the stellar spectrum, 
which can be used to distinguish between stars of 
different luminosities. 

1. For spectral types B-F, the lines of neutral hy¬ 
drogen are deeper and narrower for stars of 
higher luminosities. The reason for this is that 
the metal ions give rise to a fluctuating elec¬ 
tric field near the hydrogen atoms. This field 
leads to shifts in the hydrogen energy levels 
(the Stark effect), appearing as a broadening 
of the lines. The effect becomes stronger as 
the density increases. Thus the hydrogen lines 
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Fig. 9.6 Luminosity 
effects in the hydrogen H y 
line in A stars. The vertical 
axis gives the normalised 
intensity. HD 223385 
{upper left ) is an 
A2 supergiant, where the 
line is very weak, 

9 Aurigae A is a giant star 
and a 2 Geminorum is 
a main sequence star, 
where the line is very 
broad. [Aller, L.H. (1953): 
Astrophysics. The 
Atmospheres of the Sun 
and Stars (The Ronald 
Press Company, New York) 
p. 318] 



are narrow in absolutely bright stars, and be¬ 
come broader in main sequence stars and even 
more so in white dwarfs (Fig. 9.6). 

2. The lines from ionised elements are relatively 
stronger in high-luminosity stars. This is be¬ 
cause the higher density makes it easier for 
electrons and ions to recombine to neutral 
atoms. On the other hand, the rate of ionisa¬ 
tion is essentially determined by the radiation 
field, and is not appreciably affected by the gas 
density. Thus a given radiation field can main¬ 
tain a higher degree of ionisation in stars with 
more extended atmospheres. For example, in 
the spectral classes F-G, the relative strengths 
of the ionised strontium (Sr II) and neutral iron 
(Fe I) lines can be used as a luminosity indica¬ 


tor. Both lines depend on the temperature in 
roughly the same way, but the Sr II lines be¬ 
come relatively much stronger as the luminos¬ 
ity increases. 

3. Giant stars are redder than dwarfs of the same 
spectral type. The spectral type is determined 
from the strengths of spectral lines, including 
ion lines. Since these are stronger in giants, 
a giant will be cooler, and thus also redder, 
than a dwarf of the same spectral type. 

4. There is a strong cyanogen (CN) absorption 
band in the spectra of giant stars, which is al¬ 
most totally absent in dwarfs. This is partly 
a temperature effect, since the cooler atmo¬ 
spheres of giants are more suitable for the for¬ 
mation of cyanogen. 
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Fig. 9.7 Peculiar spectra, (a) RGeminorum (above) is an 
emission line star, with bright emission lines, indicated by 
arrows, in its spectrum; (b) the spectrum of a normal star 


9.4 Peculiar Spectra 

The spectra of some stars differ from what one 
would expect on the basis of their temperature 
and luminosity (see, e.g., Fig. 9.7). Such stars are 
celled peculiar. The most common peculiar spec¬ 
tral types will now be considered. 

The Wolf-Rayet stars are very hot stars; the 
first examples were discovered by Charles Wolf 
and Georges Rayet in 1867. The spectra of Wolf- 
Rayet stars have broad emission lines of hydro¬ 
gen and ionised helium, carbon, nitrogen and 
oxygen. There are hardly any absorption lines. 
The Wolf-Rayet stars are thought to be very 
massive stars that have lost their outer layers in 
a strong stellar wind. This has exposed the stellar 
interior, which gives rise to a different spectrum 
than the normal outer layers. Many Wolf-Rayet 
stars are members of binary systems. 

In some O and B stars the hydrogen absorp¬ 
tion lines have weak emission components ei¬ 
ther at the line centre or in its wings. These stars 
are called Be and shell stars (the letter e after 
the spectral type indicates that there are emis¬ 
sion lines in the spectrum). The emission lines 
are formed in a rotationally flattened gas shell 
around the star. The shell and Be stars show ir¬ 
regular variations, apparently related to structural 
changes in the shell. About 15 % of all O and 
B stars have emission lines in their spectra. 


is compared with one in which the zirconium lines are un¬ 
usually strong. (Mt. Wilson Observatory and Helsinki Ob¬ 
servatory) 

The strongest emission lines are those of the 
P Cygni stars, which have one or more sharp ab¬ 
sorption lines on the short wavelength side of 
the emission line. It is thought that the lines 
are formed in a thick expanding envelope. The 
P Cygni stars are often variable. For example, 
P Cygni itself has varied between three and six 
magnitudes during the past centuries. At present 
its magnitude is about 5. 

The peculiar A stars or Ap stars (p = peculiar) 
are usually strongly magnetic stars, where the 
lines are split into several components by the 
Zeeman effect. The lines of certain elements, 
such as magnesium, silicon, europium, chromium 
and strontium, are exceptionally strong in the Ap 
stars. Lines of rarer elements such as mercury, 
gallium or krypton may also be present. Other¬ 
wise, the Ap stars are like normal main sequence 
stars. 

The Am stars (m = metallic) also have anoma¬ 
lous element abundances, but not to the same ex¬ 
tent as the Ap stars. The lines of e.g. the rare 
earths and the heaviest elements are strong in 
their spectra; those of calcium and scandium are 
weak. 

We have already mentioned the S and C stars, 
which are special classes of K and M giants with 
anomalous element abundances. In the S stars, 
the normal lines of titanium, scandium and vana¬ 
dium oxide are replaced with oxides of heavier 
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Fig. 9.8 The Hertzsprung-Russell diagram. The horizon¬ 
tal coordinate can be either the colour index B — V, ob¬ 
tained directly from observations, or the spectral class. In 
theoretical studies the effective temperature T e is com¬ 
monly used. These correspond to each other but the depen¬ 
dence varies somewhat with luminosity. The vertical axis 
gives the absolute magnitude. In a (lg(L/Lg), lg T e ) plot 
the curves of constant radius are straight lines. The dens- 

elements, zirconium, yttrium and barium. A large 
fraction of the S stars are irregular variables. The 
name of the C stars refers to carbon. The metal 
oxide lines are almost completely absent in their 
spectra; instead, various carbon compounds (CN, 
C 2 , CH) are strong. The abundance of carbon rel¬ 
ative to oxygen is 4-5 times greater in the C stars 
than in normal stars. The C stars are divided into 
two groups, hotter R stars and cooler N stars. 

Another type of giant stars with abundance 
anomalies are the barium stars. The lines of bar¬ 
ium, strontium, rare earths and some carbon com¬ 
pounds are strong in their spectra. Apparently nu¬ 
clear reaction products have been mixed up to the 
surface in these stars. 


est areas are the main sequence and the horizontal, red gi¬ 
ant and asymptotic branches consisting of giant stars. The 
supergiants are scattered above the giants. To the lower 
left are some white dwarfs about 10 magnitudes below the 
main sequence. The apparently brightest stars (m < 4) are 
marked with crosses and the nearest stars (r < 50 ly) with 
dots. The data are from the Hipparcos catalogue 


9.5 The Hertzsprung-Russell 
Diagram 

Around 1910, Ejnar Hertzsprung and Henry Nor¬ 
ris Russell studied the relation between the abso¬ 
lute magnitudes and the spectral types of stars. 
The diagram showing these two variables is now 
known as the Hertzsprung-Russell diagram or 
simply the HR diagram (Fig. 9.8). It has turned 
out to be an important aid in studies of stellar evo¬ 
lution. 

In view of the fact that stellar radii, luminosi¬ 
ties and surface temperatures vary widely, one 
might have expected the stars to be uniformly dis¬ 
tributed in the HR diagram. However, it is found 
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that most stars are located along a roughly diag¬ 
onal curve called the main sequence. The Sun is 
situated about the middle of the main sequence. 

The HR diagram also shows that the yellow 
and red stars (spectral types G-K-M) are clus¬ 
tered into two clearly separate groups: the main 
sequence of dwarf stars and the giants. The gi¬ 
ant stars fall into several distinct groups. The 
horizontal branch is an almost horizontal se¬ 
quence, about absolute visual magnitude zero. 
The red giant branch rises almost vertically from 
the main sequence at spectral types K and M in 
the HR diagram. Finally, the asymptotic branch 
rises from the horizontal branch and approaches 
the bright end of the red giant branch. These var¬ 
ious branches represent different phases of stellar 
evolution (cf. Sects. 12.3 and 12.4): dense areas 
correspond to evolutionary stages in which stars 
stay a long time. 

A typical horizontal branch giant is about 
a hundred times brighter than the Sun. Since gi¬ 
ants and dwarfs of the same spectral class have 
nearly the same surface temperature, the differ¬ 
ence in luminosity must be due to a difference in 
radius according to (5.21). For example Arcturus, 
which is one of the brightest stars in the sky, has 
a radius about thirty times that of the Sun. 

The brightest red giants are the supergiants 
with magnitudes up to My = — 7. One example 
is Betelgeuze. in Orion, with a radius of 400 so¬ 
lar radii and 20,000 times more luminous than the 
Sun. 

About 10 magnitudes below the main se¬ 
quence are the white dwarfs. They are quite nu¬ 
merous in space, but faint and difficult to find. 
The best-known example is Sirius B, the com¬ 
panion of Sirius. 

There are some stars in the HR diagram 
which are located below the giant branch, but 
still clearly above the main sequence. These are 
known as subgiants. Similarly, there are stars 
below the main sequence, but brighter than the 
white dwarfs, known as subdwarfs. 

When interpreting the HR diagram, one has 
to take into account selection effects : absolutely 
bright stars are more likely to be included in the 
sample, since they can be discovered at greater 
distances. If only stars within a certain distance 


from the Sun are included, the distribution of 
stars in the HR diagram looks quite different. This 
can be seen in Fig. 9.8: there are no giant or bright 
main sequence stars among these. 

The HR diagrams of star clusters are particu¬ 
larly important for the theory of stellar evolution. 
They will be discussed in Chap. 17. 


9.6 Model Atmospheres 

The stellar atmosphere consists of those layers of 
the star where the radiation that is transmitted di¬ 
rectly to the observer originates. Thus in order to 
interpret stellar spectra, one needs to be able to 
compute the structure of the atmosphere and the 
emerging radiation. 

In actual stars there are many factors, such as 
rotation and magnetic fields, that complicate the 
problem of computing the structure of the atmo¬ 
sphere. We shall only consider the classical prob¬ 
lem of finding the structure, i.e. the distribution of 
pressure and temperature with depth, in a static, 
unmagnetised atmosphere. In that case a model 
atmosphere is completely specified by giving the 
chemical composition, the gravitational acceler¬ 
ation at the surface, g , and the energy flux from 
the stellar interior, or equivalently, the effective 
temperature 7’ c . 

The basic principles involved in computing 
a model stellar atmosphere are the same as for 
stellar interiors and will be discussed in Chap. 11 . 
Essentially, there are two differential equations 
to be solved: the equation of hydrostatic equilib¬ 
rium, which fixes the distribution of pressure, and 
an equation of energy transport, which will have 
a different form depending on whether the atmo¬ 
sphere is radiative or convective, and which de¬ 
termines the temperature distribution. 

The values of the various physical quantities 
in an atmosphere are usually given as functions of 
some suitably defined continuum optical depth r. 
Thus pressure, temperature, density, ionisation 
and the population numbers of various energy 
levels can all be obtained as functions of r. When 
these are known, the intensity of radiation emerg¬ 
ing from the atmosphere can be computed. In 
Box 9.1, it is shown that approximately the emer¬ 
gent spectrum originates at unit optical depth, 
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measured along each light ray. On this basis, one 
can predict whether a given spectral line will be 
present in the spectrum. 

Consider a spectral line formed when an atom 
(or ion) in a given energy state absorbs a photon. 
From the model atmosphere, the occupation num¬ 
ber of the absorbing level is known as a function 
of the (continuum) optical depth r. If now there 
is a layer above the depth r = 1 where the ab¬ 
sorbing level has a high occupancy, the optical 
depth in the line will become unity before r = 1, 
i.e. the radiation in the line will originate higher 
in the atmosphere. Because the temperature in¬ 
creases inwards, the intensity in the line will cor¬ 
respond to a lower temperature, and the line will 
appear dark. On the other hand, if the absorbing 
level is unoccupied, the optical depth at the line 
frequency will be the same as the continuum op¬ 
tical depth. The radiation at the line frequency 
will then come from the same depth as the ad¬ 
jacent continuum, and no absorption line will be 
formed. 

The expression for the intensity derived in 
Box 9.1 also explains the phenomenon of limb 
darkening seen in the Sun (Sect. 12.2). The radia¬ 
tion that reaches us from near the edge of the solar 
disk emerges at a very oblique angle (0 near 90°), 
i.e. cos 0 is small. Thus this radiation originates at 
small values of r, and hence at low temperatures. 
In consequence, the intensity coming from near 
the edge will be lower, and the solar disk will ap¬ 
pear darker towards the limb. The amount of limb 
darkening also gives an empirical way of deter¬ 
mining the temperature distribution in the solar 
atmosphere. 

Our presentation of stellar atmospheres has 
been highly simplified. In practice, the spectrum 
is computed numerically for a range of parameter 
values. The values of T e and element abundances 
for various stars can then be found by comparing 
the observed line strengths and other spectral fea¬ 
tures with the theoretical ones. We shall not go 
into details on the procedures used. 


9.7 What Do the Observations Tell 
Us? 

To conclude this chapter, we shall give a sum¬ 
mary of the properties of stars revealed by the ob¬ 


servations. At the end of the book, there are tables 
of the brightest and of the nearest stars. 

Of the brightest stars, four have a negative 
magnitude. Some of the apparently bright stars 
are absolutely bright supergiants, others are sim¬ 
ply nearby. 

In the list of the nearest stars, the dominance of 
faint dwarf stars, already apparent in the HR dia¬ 
gram, is worth noting. Most of these belong to the 
spectral types K and M. Some nearby stars also 
have very faint companions with masses about 
that of Jupiter, i.e. planets. They have not been 
included in the table. 

Stellar spectroscopy offers an important way 
of determining fundamental stellar parameters, in 
particular mass and radius. However, the spec¬ 
tral information needs to be calibrated by means 
of direct measurements of these quantities. These 
will be considered next. 

The masses of stars can be determined in the 
case of double stars orbiting each other. (The de¬ 
tails of the method will be discussed in Chap. 9.) 
These observations have shown that the larger 
the mass of a main sequence star becomes, the 
higher on the main sequence it is located. One 
thus obtains an empirical mass-luminosity rela¬ 
tion, which can be used to estimate stellar masses 
on the basis of the spectral type. 

The observed relation between mass and lu¬ 
minosity is shown in Fig. 9.9. The luminosity is 
roughly proportional to the power 3.8 of the mass: 

L oc M 3 ' 8 . (9.1) 

The relations is only approximate. According to 
it, a ten solar mass star is about 6300 times 
brighter than the Sun, corresponding to 9.5 mag¬ 
nitudes. 

The smallest observed stellar masses are about 
1 /20 of the solar mass, corresponding to stars in 
the lower right-hand part of the HR diagram. The 
masses of white dwarfs are less than one solar 
mass. The masses of the most massive main se¬ 
quence and supergiant stars are between 10 and 
possibly even 150 Mq. 

Direct interferometric measurements of stel¬ 
lar angular diameters have been made for only 
a few dozen stars. When the distances are known, 
these immediately yield the value of the radius. In 
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eclipsing binaries, the radius can also be directly 
measured (see Sect. 10.4). Altogether, close to 
a hundred stellar radii are known from direct 
measurements. In other cases, the radius must be 
estimated from the absolute luminosity and effec¬ 
tive temperature. 

In discussing stellar radii, it is convenient to 
use a version of the HR diagram with Ig 7/ on the 
horizontal and M\, a i or \g(L/L q) on the vertical 
axis. If the value of the radius R is fixed, then 
(5.21) yields a linear relation between the bolo- 
metric magnitude and Ig 7' e . Thus lines of con¬ 
stant radius in the HR diagram are straight. Lines 
corresponding to various values of the radius are 
shown in Fig. 9.8. The smallest stars are the white 
dwarfs with radii of about one per cent of the so- 



log M/M © 

Fig. 9.9 Mass-luminosity relation. The picture is based 
on binaries with known masses. Different symbols refer 
to different kinds of binaries. (From Bohm-Vitense: In¬ 
troduction to Stellar Astrophysics, Cambridge University 
Press (1989-1992)) 


lar radius, whereas the largest supergiants have 
radii several thousand times larger than the Sun. 
Not included in the figure are the compact stars 
(neutron stars and black holes) with typical radii 
of a few tens of kilometres. 

Since the stellar radii vary so widely, so do 
the densities of stars. The density of giant stars 
may be only 10 -4 kg/m 3 , whereas the density of 
white dwarfs is about 10 9 kg/m 3 . 

The range of values for stellar effective tem¬ 
peratures and luminosities can be immediately 
seen in the HR diagram. The range of effective 
temperature is 2000-40,000 K, and that of lumi¬ 
nosity 10 _4 -10 6 Lq. 

The rotation of stars appears as a broadening 
of the spectral lines. One edge of the stellar disk 
is approaching us, the other edge is receding, and 
the radiation from the edges is Doppler shifted 
accordingly. The rotational velocity observed in 
this way is only the component along the line of 
sight. The true velocity is obtained by dividing 
with sin i , where i is the angle between the line 
of sight and the rotational axis. A star seen from 
the direction of the pole will show no rotation. 

Assuming the axes of rotation to be randomly 
oriented, the distribution of rotational velocities 
can be statistically estimated. The hottest stars 
appear to rotate faster than the cooler ones. The 
rotational velocity at the equator varies from 
200-250 km/s for O and B stars to about 20 km/s 
for spectral type G. In shell stars, the rotational 
velocity may reach 500 km/s. 

The chemical composition of the outer layers 
is deduced from the strengths of the spectral lines. 
About three-fourths of the stellar mass is hy¬ 
drogen. Helium comprises about one-fourth, and 
the abundance of other elements is very small. 
The abundance of heavy elements in young stars 
(about 2 %) is much larger than in old ones, 
where it is less than 0.02 %. 


Box 9.1 (The Intensity Emerging from a Stellar 
Atmosphere) The intensity of radiation emerg¬ 
ing from an atmosphere is given by the expres¬ 
sion (5.45), i.e. 

7*00 

7„(O,0)= / S y (r„)e- T “ sec0 sec0dr v . 

Jo 

(9-2) 
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If a model atmosphere has been computed, the 
source function S v is known. 

An approximate formula for the intensity 
can be derived as follows. Let us expand the 
source function as a Taylor series about some 
arbitrary point r*. thus 

S v = S v (t*) + (r„ - t*)S' v (t*) H- 

where the dash denotes a derivative. With this 
expression, the integral in (9.2) can be evalu¬ 
ated, yielding 

I v (0, 6) = S v (r *) + (cos 9 - r*) S' v (r*) + ■ ■ ■ . 

If we now choose r* = cos0, the second 
term will vanish. In local thermodynamic 


equilibrium the source function will be the 
Planck function B V (T). We thus obtain the 
Eddington-Barbier approximation 

I v (0, 9) = B v (T[t v — cos0]). 

According to this expression, the radiation 
emerging in a given direction originated at unit 
optical depth along that direction. 


9.8 Exercise 

Exercise 9.1 Arrange the below spectra in the 
order of decreasing temperature. 
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Quite often, two stars may appear to be close to¬ 
gether in the sky, although they are really at very 
different distances. Such chance pairs are called 
optical binary stars. However, many close pairs 
of stars really are at the same distance and form 
a physical system in which two stars are orbit¬ 
ing around each other. Less than half of all stars 
are single stars like the Sun. More than 50 % be¬ 
long to systems containing two or more members. 
In general, the multiple systems have a hierarchi¬ 
cal structure: a star and a binary orbiting around 
each other in triple systems, two binaries orbit¬ 
ing around each other in quadruple systems. Thus 
most multiple systems can be described as bina¬ 
ries with several levels. 

Binaries are classified on the basis of the 
method of their discovery. This classification has 
nothing to do with the physical properties of the 
stars. Visual binaries can be seen as two sepa¬ 
rate components, i.e. the separation between the 
stars is larger than about 0.1 arc seconds. The rel¬ 
ative position of the components changes over the 
years as they move in their orbits (Fig. 10.1). In 
astrometric binary stars only one component is 
seen, but its variable proper motion shows that 
a second invisible component must be present. 
The spectroscopic binary stars are discovered on 
the basis of their spectra. Either two sets of spec¬ 
tral lines are seen or else the Doppler shift of the 
observed lines varies periodically, indicating an 
invisible companion. The fourth class of bina¬ 
ries are the photometric binary stars or eclips¬ 
ing variables. In these systems the components 
of the pair regularly pass in front of each other, 


causing a change in the total apparent magni¬ 
tude. 

Binary stars can also be classified on the ba¬ 
sis of their mutual separation. In distant binaries 
the separation between the components is tens or 
hundreds of astronomical units and their orbital 
periods are from tens to thousands of years. In 
close binaries the separation is from about one 
au down to the radius of the stars. The orbital pe¬ 
riod ranges from a few hours to a few years. The 
components of contact binaries are so close that 
they are touching each other. 

The stars in a binary system move in an ellipti¬ 
cal orbit around the centre of mass of the system. 
In Chap. 6 it was shown that the relative orbit, too, 
is an ellipse, and thus the observations are often 
described as if one component remained station¬ 
ary and the other orbited around it. 


10.1 Visual Binaries 

We consider a visual binary, assuming initially 
that the brighter primary component is station¬ 
ary and the fainter secondary component is or¬ 
biting around it. The angular separation of the 
stars and the angular direction to the secondary 
can be directly observed. Making use of observa¬ 
tions extending over many years or decades, the 
relative orbit of the secondary can be determined. 
The first binary orbit to be determined was that of 
£ UMa in 1830 (Fig. 10.2). 

The observations of visual binaries only give 
the projection of the relative orbital ellipse on the 
plane of the sky. The shape and position of the 
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Fig. 10.1 When a visual 
binary is followed for 
a long time, the 
components can be seen to 
move with respect to each 
other. Picture of Kruger 60. 
(Yerkes Observatory) 
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Fig. 10.2 In 1830 the orbit of £ Ursae Majoris was the 
first binary orbit determined observationally 



Fig. 10.3 The components of a binary system move 
around their common centre of mass. Ai, Ai denote the 
positions of the stars at a given time A, and similarly for 
B and C 


true orbit are not known. However, they can be 
calculated if one makes use of the fact that the 
primary should be located at a focal point of the 
relative orbit. The deviation of the projected po¬ 
sition of the primary from the focus of the pro¬ 
jected relative orbit allows one to determine the 
orientation of the true orbit. 

The absolute size of the orbit can only be 
found if the distance of the binary is known. 
Knowing this, the total mass of the system can 
be calculated from Kepler’s third law. 

The masses of the individual components 
can be determined by observing the motions of 
both components relative to the centre of mass 
(Fig. 10.3). Let the semimajor axes of the or¬ 
bital ellipses of the primary and the secondary 
be a i and <72 ■ Then, according to the definition of 
the centre of mass, 

a 1 m2 

— = ( 10 . 1 ) 

«2 m 1 


where m\ and mi are the component masses. The 
semimajor axis of the relative orbit is 

a — a\ + ci 2 . (10.2) 

For example, the masses of the components of 
£ UMa have been found to be 1.3 and 1.0 solar 
masses. 


10.2 Astrometric Binary Stars 

In astrometric binaries, only the orbit of the 
brighter component about the centre of mass can 
be observed. If the mass of the visible component 
is estimated, e.g. from its luminosity, the mass of 
the invisible companion can also be estimated. 

The first astrometric binary was Sirius, which 
in the 1830’s was observed to have an undulat¬ 
ing proper motion. It was concluded that it had 
a small companion, which was visually discov¬ 
ered a few decades later (Figs. 10.4 and 15.1). 
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The companion, Sirius B, was a completely new 
type of object, a white dwarf (Sect. 15.1). 

The proper motions of nearby stars have been 
carefully studied in the search for planetary sys¬ 
tems. Although e.g. Barnard’s star may have un¬ 
seen companions, the existence of planetary sys¬ 
tems around other stars was not established by 
proper motion studies but with spectroscopic ob¬ 
servations (Sect. 21.8). 


10.3 Spectroscopic Binaries 

The spectroscopic binaries (Fig. 10.5) appear as 
single stars in even the most powerful telescopes, 
but their spectra show a regular variation. The 
first spectroscopic binary was discovered in the 
1880’s, when it was found that the spectral lines 
of f UMa or Mizar split into two at regular inter¬ 
vals. 

The Doppler shift of a spectral line is directly 
proportional to the radial velocity. Thus the sep¬ 


aration of the spectral lines is largest when one 
component is directly approaching and the other 
is receding from the observer. The period of the 
variation is the orbital period of the stars. Unfor¬ 
tunately, there is no general way of determining 
the position of the orbit in space. The observed 
velocity v is related to the true velocity vq ac¬ 
cording to 

D = i>osini, (10.3) 

where the inclination i is the angle between the 
line of sight and the normal of the orbital plane. 

Consider a binary where the components 
move in circular orbits about the centre of mass. 
Let the radii of the orbits be a\ and ai. From the 
definition of the centre of mass m\ci\ = 11120.2, 
and writing a — a\ + 02, one obtains 


m 1 +1112 


Fig. 10.4 The apparent 
paths of Sirius and its 
companion in the sky 


1950 




Fig. 10.5 Spectrum of the spectroscopic binary k Arietis. In the upper spectrum the spectral lines are single, in the 
lower one doubled. (Lick Observatory) 
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The true orbital velocity is 

27Tfll 

110,1 = 

where P is the orbital period. The observed or¬ 
bital velocity according to (10.3) is thus 


tude of the periastron can be determined. Know¬ 
ing these, the mass function or the individual 
masses can again be determined to within a fac¬ 
tor sin 3 i. 


10.4 Photometric Binary Stars 


2jta\ sin; 



Substituting (10.4), one obtains 
2 Tta ;«Tsin; 

t>i =---. 

P m i + ;«2 


(10.5) 


Solving for a and substituting it in Kepler’s third 
law, one obtains the mass function equation'. 


sin 3 i v\P 
{mi+mfr 2 jtG 


( 10 . 6 ) 


If one component in a spectroscopic binary is 
so faint that its spectral lines cannot be observed, 
only P and i>i are observed. Equation (10.6) then 
gives the value of the mass function, which is 
the expression on the left-hand side. Neither the 
masses of the components nor the total mass can 
be determined. If the spectral lines of both com¬ 
ponents can be observed, vi is also known. Then 
(10.5) gives 

hi at 

V 2 «2 

and furthermore the definition of the centre of 
mass gives 

m 2 v 2 

m i =-. 

ill 

When this is substituted in (10.6), the value of 
m 2 sin 3 ;, and correspondingly, m i sin 3 ;, can be 
determined. However, the actual masses cannot 
be found without knowing the inclination. 

The size of the binary orbit (the semimajor 
axis a) is obtained from (10.5) apart from a fac¬ 
tor sin i. In general the orbits of binary stars are 
not circular and the preceding expressions cannot 
be applied as they stand. For an eccentric orbit, 
the shape of the velocity variation departs more 
and more from a simple sine curve as the ec¬ 
centricity increases. From the shape of the veloc¬ 
ity variation, both the eccentricity and the longi¬ 


In the photometric binaries, a periodic variation 
in the total brightness is caused by the motions 
of the components in a double system. Usu¬ 
ally the photometric binaries are eclipsing vari¬ 
ables, where the brightness variations are due to 
the components passing in front of each other. 
A class of photometric binaries where there are 
no actual eclipses are the ellipsoidal variables. 
In these systems, at least one of the components 
has been distorted into an ellipsoidal shape by the 
tidal pull of the other one. At different phases of 
the orbit, the projected surface area of the dis¬ 
torted component varies. The surface temperature 
will also be lower at the ends of the tidal bulges. 
Together these factors cause a small variation in 
brightness. 

The inclination of the orbit of an eclipsing bi¬ 
nary must be very close to 90°. These are the only 
spectroscopic binaries for which the inclination is 
known and thus the masses can be uniquely deter¬ 
mined. 

The variation of the magnitude of eclips¬ 
ing variables as a function of time is called 
the lightcurve. According to the shape of the 
lightcurve, they are grouped into three main 
types: Algol, ft Lyrae and W Ursae Majoris type 
(Fig. 10.6). 

Algol Stars The Algol-type eclipsing variables 
have been named after f Persei or Algol. During 
most of the period, the lightcurve is fairly con¬ 
stant. This corresponds to phases during which 
the stars are seen separate from each other and 
the total magnitude remains constant. There are 
two different minima in the lightcurve, one of 
which, the primary minimum, is usually much 
deeper than the other one. This is due to the 
brightness difference of the stars. When the larger 
star, which is usually a cool giant, eclipses the 
smaller and hotter component, there is a deep 
minimum in the lightcurve. When the small, 
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Fig. 10.6 Typical lightcurves and schematic views of Al¬ 
gol, ft Lyrae and W Ursae Majoris type binary systems. 
The size of the Sun is shown for comparison 

bright star passes across the disk of the giant, the 
total magnitude of the system does not change 
by much. 

The shape of the minima depends on whether 
the eclipses are partial or total. In a partial eclipse 
the lightcurve is smooth, since the brightness 
changes smoothly as the depth of the eclipse 
varies. In a total eclipse there is an interval dur¬ 
ing which one component is completely invis¬ 
ible. The total brightness is then constant and 
the lightcurve has a flat bottomed minimum. The 


shape of the minima in Algol variables thus gives 
information on the inclination of the orbit. 

The duration of the minima depends on the ra¬ 
tio of the stellar radii to the size of the orbit. If the 
star is also a spectroscopic binary, the true dimen¬ 
sions of the orbit can be obtained. In that case the 
masses and the size of the orbit, and thus also the 
radii can be determined without having to know 
the distance of the system. 

ft Lyrae Stars In the /l Lyrae-type binaries, the 
total magnitude varies continuously. The stars are 
so close to each other that one of them has been 
pulled into ellipsoidal shape. Thus the brightness 
varies also outside the eclipses. The ft Lyrae vari¬ 
ables can be described as eclipsing ellipsoidal 
variables. In the ft Lyrae system itself, one star 
has overfilled its Roche lobe (see Sect. 12.6) and 
is steadily losing mass to its companion. The 
mass transfer causes additional features in the 
lightcurve. 

W UMa Stars In W UMa stars, the lightcurve 
minima are almost identical, very round and 
broad. These are close binary systems where both 
components overfill their Roche lobes, forming 
a contact binary system. 

The observed lightcurves of photometric bi¬ 
naries may contain many additional features that 
confuse the preceding classification. 

- The shape of the star may be distorted by the 
tidal force of the companion. The star may be 
ellipsoidal or fill its Roche surface, in which 
case it becomes drop-like in shape. 

- The limb darkening (Sects. 9.6 and 13.2) of the 
star may be considerable. If the radiation from 
the edges of the stellar disk is fainter than that 
from the centre, it will tend to round off the 
lightcurve. 

- In elongated stars there is gravity darkening: 
the parts most distant from the centre are cooler 
and radiate less energy. 

- There are also reflection phenomena in stars. If 
the stars are close together, they will heat the 
sides facing each other. The heated part of the 
surface will then be brighter. 

- In systems with mass transfer, the material 
falling onto one of the components will change 
the surface temperature. 













246 


10 Binary Stars and Stellar Masses 


All these additional effects cause difficulties 
in interpreting the lightcurve. Usually one com¬ 
putes a theoretical model and the corresponding 
lightcurve, which is then compared with the ob¬ 
servations. The model is varied until a satisfac¬ 
tory fit is obtained. 

So far we have been concerned solely with the 
properties of binary systems in the optical do¬ 
main. Recently many double systems that radiate 
strongly at other wavelengths have been discov¬ 
ered. Particularly interesting are the binary pul¬ 
sars, where the velocity variation can be deter¬ 
mined from radio observations. Many different 
types of binaries have also been discovered at 
X-ray wavelengths. These systems will be dis¬ 
cussed in Chap. 15. 

The binary stars are the only stars with ac¬ 
curately known masses. The masses for other 
stars are estimated from the mass-luminosity re¬ 
lation (Sect. 9.7), but that is valid only for main- 
sequence stars and has to be calibrated by means 
of binary observations. 


10.5 Examples 


Example 10.2 (The Lightcurve of a Binary) Let 
us suppose that the line of sight lies in the orbital 
plane of an Algol type binary, where both com¬ 
ponents have the same radius. The lightcurve is 
essentially as shown in the figure. The primary 
minimum occurs when the brighter component is 
eclipsed. The depth of the minima will be calcu¬ 
lated. 



If the effective temperatures of the stars are Ta 
and Tg and their radius is R. their luminosities 
are given by 

L a =4jtR 2 (jT^ L b =4jtR 2 (tT%. 

The flat part of the curve corresponds to the total 
luminosity 


Example 10.1 (The Mass of a Binary Star) The 
distance of a binary star is 10 pc and the largest 
angular separation of the components is 7" and 
the smallest is 1". The orbital period is 100 years. 
The mass of the binary is to be determined, as¬ 
suming that the orbital plane is normal to the line 
of sight. 

From the angular separation and the distance, 
the semimajor axis is 

a — 4" x 10 pc = 40 au. 

According to Kepler’s third law 

a 3 40 3 

mi + m 2 = -j^2 — Yqq2 M ® = 6A M ©- 

Let the semimajor axis of one component be ai — 
3" and for the other a 2 = 1". Now the masses of 
the components can be determined separately: 

d2 m 2 

m\a\ =m 2 ai =>■ m\ = —m 2 — — 

a\ 3 

m\ + m 2 = 6.4 =>■ m j = 1.6, m 2 — 4.8. 


7-tot — L a + L B - 

The luminosities may be expressed as absolute 
bolometric magnitudes by means of (4.13). Since 
the distance moduli of the components are the 
same, the apparent bolometric magnitude at the 
primary minimum will be 


m A - m tot = M a ~ M tot 
La 

— —2-51g ——- 

^tot 


+2.5 lg 


Ttot 

~L~a 


= 2.5 lg 


An R 2 oT^ + 4nR 2 <jTg 
4nR 2 aT^ 


= 2.5 lg 



Similarly the depth of the secondary minimum is 


m B - m t ot = 2.5 lg 




Let the effective temperatures of the stars be 
Ta = 5000 K and T B — 12,000 K. The depth of 
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the primary minimum is then 

( / 12 , 000 \ 4 \ 

~ 3.8 mag. 

The secondary minimum is 

mj -™, 0 , = 2.51g(l + ( ; |^) ) 

~ 0.03 mag. 

10.6 Exercises 

Exercise 10.1 The components of a binary 
move along circular orbits. The mutual distance 
is 1 au, and the mass of each component is 1 Mq. 
An observer in the plane of the orbit will see pe¬ 
riodic splitting of the spectral lines. What is the 


maximum separation of the components of the 
H y line. 

Exercise 10.2 A planet (mass m) is orbiting 
a star (mass M) at a distance a. The distance of 
the star from the centre of gravity of the system 
is a'. Show that 

MP 2 — a 2 (a — o'), 

where P is period in years, distances are in au’s 
and masses in solar masses. 

Exercise 10.3 The distance of Barnard’s star is 
1.83 pc and mass 0.135 Mq. It has been sug¬ 
gested that it oscillates with an amplitude of 
0.026" in 25 year periods. Assuming this oscil¬ 
lation is caused by a planet, find the mass and ra¬ 
dius of the orbit of this planet. 
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The stars are huge gas spheres, hundreds of thou¬ 
sands or millions of times more massive than the 
Earth. A star such as the Sun can go on shin¬ 
ing steadily for thousands of millions of years. 
This is shown by studies of the prehistory of 
the Earth, which indicate that the energy radi¬ 
ated by the Sun has not changed by much during 
the last four thousand million years. The equilib¬ 
rium of a star must remain stable for such peri¬ 
ods. 


11.1 Internal Equilibrium Conditions 



Mathematically the conditions for the internal 
equilibrium of a star can be expressed as four 
differential equations governing the distribution 
of mass, gas pressure and energy production and 
transport in the star. These equations will now be 
derived. 

Hydrostatic Equilibrium The force of gravity 
pulls the stellar material towards the centre. It is 
resisted by the pressure force due to the thermal 
motions of the gas molecules. The first equilib¬ 
rium condition is that these forces be in equilib¬ 
rium. 

Consider a cylindrical volume element at the 
distance r from the centre of the star (Fig. 11.1). 
Its volume is dV — dAdr, where dA is its base 
area and dr its height; its mass is dm = pdAdr, 
where p = p(r) is the gas density at the radius r. 
If the mass inside radius r is M r , the gravitational 


Fig. 11.1 In hydrostatic equilibrium the sum of the grav¬ 
itational and pressure force acting on a volume element is 
zero 


force on the volume element will be 


AF„ = -- 


GM r dm 


GMrP 

r- 


dA dr. 


where G is the gravitational constant. The minus 
sign in this expression means that the force is di¬ 
rected towards the centre of the star. If the pres¬ 
sure at the lower surface of the volume element 
is P and at its upper surface P + dP, the net force 
of pressure acting on the element is 


dF p = P dA — {P + dP) dA 
=—dP dA. 


Since the pressure decreases outwards, dP will 
be negative and the force dF p positive. The equi- 
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librium condition is that the total force acting on 
the volume element should be zero, i.e. 


0 = dF g + dP p 


GMrP 

r 2 


d A dr — dP d A 


and mass d M r at the radius r (Fig. 1 1.3). Let L, 
be the energy flux, i.e. the amount of energy pass¬ 
ing through the surface r per unit time. If e is the 
energy production coefficient, i.e. the amount of 
energy released in the star per unit time and mass, 
then 


or 


dP _ GM r p 
dr r 2 


( 11 . 1 ) 


This is the equation of hydrostatic equilibrium. 


Mass Distribution The second equation gives 
the mass contained within a given radius. Con¬ 
sider a spherical shell of thickness dr at the dis¬ 
tance r from the centre (Fig. 1 1.2). Its mass is 

d M r = 4tt r 2 p dr, 


dL r = L r _|_dr — L r — ed M r — 4jtr 2 pedr. 
Thus the energy conservation equation is 

dP f r, 

-=47 xr~pe. (11-3) 

dr 

The rate at which energy is produced depends on 
the distance to the centre. Essentially all of the 
energy radiated by the star is produced in the hot 
and dense core. In the outer layers the energy pro¬ 
duction is negligible and L r is almost constant. 


giving the mass continuity equation 


d M r 
”dT 


= 47rr 2 p. 


( 11 . 2 ) 


Energy Production The third equilibrium con¬ 
dition expresses the conservation of energy, re¬ 
quiring that any energy produced in the star has 
to be carried to the surface and radiated away. We 
again consider a spherical shell of thickness dr 


The Temperature Gradient The fourth equi¬ 
librium equation gives the temperature change as 
a function of the radius, i.e. the temperature gra¬ 
dient dT/dr. The form of the equation depends 
on how the energy is transported: by conduction, 
convection or radiation. 

In the interiors of normal stars conduction is 
very inefficient, since the electrons carrying the 
energy can only travel a short distance with¬ 
out colliding with other particles. Conduction 




Fig. 11.3 The energy flowing out of a spherical shell is 
Fig. 11.2 The mass of a thin spherical shell is the product the sum of the energy flowing into it and the energy gen- 
of its volume and its density erated within the shell 
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only becomes important in compact stars, white 
dwarfs and neutron stars, where the mean free 
path of photons is extremely short, but that of 
some electrons can be relatively large. In nor¬ 
mal stars conductive energy transport can be ne¬ 
glected. 

In radiative energy transport, photons emitted 
in hotter parts of the star are absorbed in cooler 
regions, which they heat. The star is said to be in 
radiative equilibrium, when the energy released 
in the stellar interior is carried outwards entirely 
by radiation. 

The radiative temperature gradient is related to 
the energy flux L r according to 



where a = Aa/c = 7.564 x 10“ 16 Jm- 3 K“ 4 is 
the radiation constant, c the speed of light, and 
p the density. The mass absorption coefficient k 
gives the amount of absorption per unit mass. Its 
value depends on the temperature, density and 
chemical composition. 

In order to derive ( 11 .4), we consider the equa¬ 
tion of radiative transfer (5.44). In terms of the 
variables used in the present chapter, it may be 
written 


between the flux density F r and the energy 
flux L r , one obtains (11.4). 

The derivative d T/dr is negative, since the 
temperature increases inwards. Clearly there has 
to be a temperature gradient, if energy is to be 
transported by radiation: otherwise the radiation 
field would be the same in all directions and the 
net flux F r would vanish. 

If the radiative transfer of energy becomes in¬ 
efficient, the absolute value of the radiative tem¬ 
perature gradient becomes very large. In that case 
motions are set up in the gas, which carry the 
energy outwards more efficiently than the radi¬ 
ation. In these convective motions, hot gas rises 
upwards into cooler layers, where it loses its en¬ 
ergy and sinks again. The rising and sinking gas 
elements also mix the stellar material, and the 
composition of the convective parts of a star be¬ 
comes homogeneous. Radiation and conduction, 
on the other hand, do not mix the material, since 
they move only energy, not gas. 

In order to derive the temperature gradient for 
the convective case, consider a rising bubble. As¬ 
sume that the gas moving with the bubble obeys 
the adiabatic equation of state 

TocP l ~r, (11.5) 


COS0 


d/y 

dr 


- K v ply T j V . 


where P is the pressure of the gas and y, the adi¬ 
abatic exponent 


In this equation k v is replaced with a suitable 
mean value k. The equation is then multiplied 
with cos 0 and integrated over all directions and 
frequencies. On the left hand side, I v may be ap¬ 
proximated with the Planck function B v . The fre¬ 
quency integral may then be evaluated by means 
of (5.16). On the right-hand side, the first term 
can be expressed in terms of the flux density ac¬ 
cording to (4.2) and the integral over directions of 
the second gives zero, since j v does not depend 
on 0. One thus obtains 


4?r d_ / ac^ A 
3 dr \47r ) 


= -KpFy 


Finally, using the relation 


F r = 


L r 

Anr 1 ’ 


y = C P /C v , (11.6) 


is the ratio of the specific heats in constant pres¬ 
sure and constant volume. This ratio of the spe¬ 
cific heats depends on the ionisation of the gas, 
and can be computed when the temperature, den¬ 
sity and chemical composition are known. 

Taking the derivative of (11.5) we get the ex¬ 
pression for the convective temperature gradient 


dT / 1 \ T dP 

dr \ Y) P dr 


(11.7) 


In the practical computation of stellar struc¬ 
ture, one uses either (11.4) or (11.7), depending 
on which equation gives a less steep temperature 
gradient. In the outermost layers of stars heat ex¬ 
change with the surroundings must be taken into 
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account, and (11.7) is no longer a good approxi¬ 
mation. An often used method for calculating the 
convective temperature gradient in that case is the 
mixing-length theory. The theory of convection is 
a difficult and still imperfectly understood prob¬ 
lem, which is beyond the scope of this presenta¬ 
tion. 

The convective motions set in when the radia¬ 
tive temperature gradient becomes larger in ab¬ 
solute value than the adiabatic gradient, i.e. if ei¬ 
ther the radiative gradient becomes steep or if the 
convective gradient becomes small. From ( 11 .4) 
it can be seen that a steep radiative gradient is 
expected, if either the energy flux density or the 
mass absorption coefficient becomes large. The 
convective gradient may become small, if the adi¬ 
abatic exponent approaches 1. 

Boundary Conditions In order to obtain a well- 
posed problem, some boundary conditions have 
to be prescribed for the preceding differential 
equations: 

- There are no sources of energy or mass at the 
centre inside the radius r — 0; thus Mq — 0 and 
Lq = 0 . 

- The total mass within the radius R of the star 
is fixed, Mr = M. 

- The temperature and pressure at the stellar sur¬ 
face have some determinate values, Tr and Pr. 
These will be very small compared to those in 
the interior, and thus it is usually sufficient to 
take Tr — 0 and Pr = 0. 


11.2 Physical State of the Gas 

Due to the high temperature the gas in the stars is 
almost completely ionised. The interactions be¬ 
tween individual particles are small, so that, to 
a good approximation, the gas obeys the perfect 
gas equation of state, 

P — —-— pT, (11.8) 

/X/WH 

where k is Boltzmann’s constant, /i the mean 
molecular weight in units of mu, and mu the 
mass of the hydrogen atom. 

The mean molecular weight can be approxi¬ 
mately calculated assuming complete ionisation. 
An atom with nuclear charge Z then produces 
Z + 1 free particles (the nucleus and Z electrons). 
Hydrogen gives rise to two particles per atomic 
mass unit; helium gives rise to three particles per 
four atomic mass units. For all elements heavier 
than hydrogen and helium it is usually sufficient 
to take Z + 1 to be half the atomic weight. (Exact 
values could easily be calculated, but the abun¬ 
dance of heavy elements is so small that this is 
usually not necessary.) In astrophysics the rela¬ 
tive mass fraction of hydrogen is conventionally 
denoted by X, that of helium by Y and that of all 
heavier elements by Z, so that 

X+Y + Z= 1. (11.9) 

(The Z occurring in this equation should not be 
confused with the nuclear charge, which is un¬ 
fortunately denoted by the same letter.) Thus the 
mean molecular weight will be 


In addition to these boundary conditions one 
needs an expression for the pressure, which is 
given by the equation of state as well as expres¬ 
sions for the mass absorption coefficient and the 
energy generation rate, which will be considered 
later. The solution of the basic differential equa¬ 
tions give the mass, temperature, density and en¬ 
ergy flux as functions of the radius. The stellar 
radius and luminosity can then be calculated and 
compared with the observations. 

The properties of a stellar equilibrium model 
are essentially determined once the mass and the 
chemical composition have been given. This re¬ 
sult is known as the Vogt-Russell theorem. 


T = 


1 

2X+ \Y+ \Z' 


( 11 . 10 ) 


At high temperatures the radiation pressure 
has to be added to the gas pressure described by 
the perfect gas equation. The pressure exerted by 
radiation is (see Box 1 1.2) 


Frad=^aT 4 , (11.11) 

where a is the radiation constant. Thus the total 
pressure is 


k 1 , 

P= - pT+-aT 4 . (11.12) 

pmu 3 
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The perfect gas law does not apply at very high 
densities. 

The Pauli exclusion principle states that an 
atom with several electrons cannot have more 
than one electron with all four quantum numbers 
equal. This can also be generalised to a gas con¬ 
sisting of electrons (or other fermions). A phase 
space can be used to describe the electrons. The 
phase space is a 6-dimensional space, three coor¬ 
dinates of which give the position of the particle 
and the other three coordinates the momenta in 
x , y and z directions. A volume element of the 
phase space is 

AT = AxAyAzAp x Ap y Ap z . (11.13) 

From the uncertainty principle it follows that the 
smallest meaningful volume element is of the or¬ 
der of h 3 . According to the exclusion principle 
there can be only two electrons with opposite 
spins in such a volume element. When density 
becomes high enough, all volume elements of the 
phase space will be filled up to a certain limiting 
momentum. Such matter is called degenerate. 

Electron gas begins to degenerate when the 
density is of the order 10 7 kg/m 3 . In ordinary 
stars the gas is usually nondegenerate, but in 
white dwarfs and in neutron stars, degeneracy is 
of central importance. The pressure of a degener¬ 
ate electron gas is (see Box 1 1.2) 



where m e is the electron mass and N/V the num¬ 
ber of electrons per unit volume. This equation 
may be written in terms of the density 


p = Npemn/V, 


where /r c is the mean molecular weight per free 
electron in units of /«h- An expression for /x e may 
be derived in analogy with (11.10): 


Me 


I 

x+\y+\z 


2 

X+l' 


(11.15) 


For the solar hydrogen abundance this yields 


Me = 2/(0.71 + 1) = 1.17. 


The final expression for the pressure is 



(11.16) 


This is the equation of state of a degenerate elec¬ 
tron gas. In contrast to the perfect gas law the 
pressure no longer depends on the temperature, 
only on the density and on the particle masses. 

In normal stars the degenerate gas pressure is 
negligible, but in the central parts of giant stars 
and in white dwarfs, where the density is of the 
order of 10 8 kg/m 3 , the degenerate gas pressure 
is dominant, in spite of the high temperature. 

At even higher densities the electron momenta 
become so large that their velocities approach the 
speed of light. In this case the formulas of the 
special theory of relativity have to be used. The 
pressure of a relativistic degenerate gas is 


P 


he 


JV\ 4/3 

vj 


— he 


4/3 


Me'«H/ 


(11.17) 


In the relativistic case the pressure is proportional 
to the density to the power 4/3, rather than 5/3 
as for the nonrelativistic case. The transition to 
the relativistic situation takes place roughly at the 
density 10 9 kg/m 3 . 

In general the pressure inside a star depends 
on the temperature (except for a completely de¬ 
generate gas), density and chemical composition. 
In actual stars the gas will never be totally ionised 
or completely degenerate. The pressure will then 
be given by more complicated expressions. Still 
it can be calculated for each case of interest. One 
may then write 


P = P(T,p,X,Y,Z), (11.18) 


giving the pressure as a known function of the 
temperature, density and chemical composition. 

The opacity of the gas describes how difficult 
it is for radiation to propagate through it. The 
change d I of the intensity in a distance dr can 
be expressed as 


d I = —la dr, 

where a is the opacity (Sect. 4.5). The opacity 
depends on the chemical composition, tempera- 
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ture and density of the gas. It is usually written as 
a = Kp, where p is the density of the gas and k 
the mass absorption coefficient ([«■] = m 2 /kg). 

The inverse of the opacity represents the mean 
free path of radiation in the medium, i.e. the dis¬ 
tance it can propagate without being scattered 
or absorbed. The different types of absorption 
processes (bound-bound, bound-free, free-free) 
have been described in Sect. 5.1. The opacity of 
the stellar material due to each process can be 
calculated for relevant values of temperature and 
density. 


11.3 Stellar Energy Sources 

When the equations of stellar structure were de¬ 
rived, the character of the source of stellar energy 
was left unspecified. Knowing a typical stellar lu¬ 
minosity, one can calculate how long different 
energy sources would last. For instance, normal 
chemical burning could produce energy for only 
a few thousand years. The energy released by the 
contraction of a star would last slightly longer, 
but after a few million years this energy source 
would also run out. 

Terrestrial biological and geological evidence 
shows that the solar luminosity has remained 
fairly constant for at least a few thousand mil¬ 
lion years. Since the age of the Earth is about 
5000 million years, the Sun has presumably ex¬ 
isted at least for that time. Since the solar lu¬ 
minosity is 4 x 10 26 W, it has radiated about 
6 x 10 43 J in 5 x 10 9 years. The Sun’s mass is 
2 x 10 3 ° kg; thus it must be able to produce at 
least 3 x 10 13 J/kg. 

The general conditions in the solar interior 
are known, regardless of the exact energy source. 
Thus, in Example 11.5, it will be estimated that 
the temperature at half the radius is about 5 mil¬ 
lion degrees. The central temperature must be 
about ten million kelvins, which is high enough 
for thermonuclear fusion reactions to take place. 

In fusion reactions light elements are trans¬ 
formed into heavier ones. The final reaction prod¬ 
ucts have a smaller total mass than the initial nu¬ 
clei. This mass difference is released as energy 
according to Einstein’s relation E — me 2 . Ther¬ 


monuclear reactions are commonly referred to as 
burning, although they have no relation to the 
chemical burning of ordinary fuels. 

The atomic nucleus consists of protons and 
neutrons, together referred to as nucleons. We de¬ 
fine 

m p — proton mass, 
m n = neutron mass, 

Z = nuclear charge = atomic number, 

N — neutron number, 

A = Z + N = atomic weight, 
m(Z, N ) = mass of the nucleus. 

The mass of the nucleus is smaller than the 
sum of the masses of all its nucleons. The dif¬ 
ference is called the binding energy. The binding 
energy per nucleon is 

Q = -(Zm p + Nm n -m(Z,N))c 2 . (11.19) 

r l 

It turns out that Q increases towards heavier ele¬ 
ments up to iron (Z = 26). Beyond iron the bind¬ 
ing energy again begins to decrease (Fig. 1 1.4). 

It is known that the stars consist mostly of hy¬ 
drogen. Let us consider how much energy would 
be released by the fusion of four hydrogen nu¬ 
clei into a helium nucleus. The mass of a pro¬ 
ton is 1.672 x 10 -27 kg and that of a helium nu¬ 
cleus is 6.644 x 10 -27 kg. The mass difference, 
4.6 x 10 -29 kg, corresponds to an energy differ¬ 
ence E — 4.1 x 10“ 12 J. Thus 0.7 % of the mass 
is turned into energy in the reaction, correspond¬ 
ing to an energy release of 6.4 x 10 14 J per one 
kilogram of hydrogen. This should be compared 
with our previous estimate that 3 x 10 13 J/kg is 
needed. 

Already in the 1930’s it was generally ac¬ 
cepted that stellar energy had to be produced by 
nuclear fusion. In 1938 Elans Bethe and inde¬ 
pendently Carl Friedrich von Weizsdcker put for¬ 
ward the first detailed mechanism for energy pro¬ 
duction in the stars, the carbon-nitrogen-oxygen 
(CNO) cycle. The other important energy gener- 
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Fig. 11.4 The nuclear 
binding energy per nucleon 
as a function of the atomic 
weight. Among isotopes 
with the same atomic 
weight the one with the 
largest binding energy is 
shown. The points 
correspond to nuclei with 
even proton and neutron 
numbers, the crosses to 
nuclei with odd mass 
numbers. Preston, M.A. 
(1962): Physics of the 
Nucleus (Addison-Wesley 
Publishing Company, Inc., 
Reading, Mass.) 



ation processes (the proton-proton chain and the 
triple-alpha reaction ) were not proposed until the 
1950’s. 

The Proton-Proton Chain (Fig. 1 1.5). In stars 
with masses of about that of the Sun or smaller, 
the energy is produced by the proton-proton (pp) 
chain. It consists of the following steps: 


ppl: (1) 

l H+ l H 

-* 2 H + e+ + l’e 


^-t- *H + e- 

2 H + v e , 

(2) 

2 h+ 'h 

—> 3 He + y, 

(3) 

3 He + 3 He 

-* 4 He + 2'H. 


For each reaction (3) the reactions (1) and (2) 
have to take place twice. The first reaction step 
has a very small probability, which has not been 
measured in the laboratory. At the central den¬ 
sity and temperature of the Sun, the expected 
time for a proton to collide with another one to 
form a deuteron is 10 10 years on the average. It is 
only thanks to the slowness of this reaction that 
the Sun is still shining. If it were faster, the Sun 
would have burnt out long ago. The neutrino pro¬ 
duced in the reaction (1) can escape freely from 
the star and carries away some of the energy re¬ 
leased. The positron e + is immediately annihi¬ 
lated together with an electron, giving rise to two 
gamma quanta. 


The second reaction, where a deuteron and 
a proton unite to form the helium isotope 3 He, is 
very fast compared to the preceding one. Thus the 
abundance of deuterons inside stars is very small. 

The last step in the pp chain can take three dif¬ 
ferent forms. The ppl chain shown above is the 
most probable one. In the Sun 91 % of the energy 
is produced by the ppl chain. It is also possible 
for 3 He nuclei to unite into 4 He nuclei in two ad¬ 
ditional branches of the pp chain. 


ppll: 

(3) 

3 He + 4 He 

—»■ 7 Be + y, 


(4) 

7 Be + e - 

7 Li + v e , 


(5) 

7 Li+ 1 H 

-* 4 He + 4 He, 

PPlH: 

(3) 

3 He + 4 He 

—> 7 Be + y, 


(4) 

7 Be+ *H 

-* 8 B + y, 


(5) 

8 B 

-* 8 Be + e+ + 


(6) 

8 Be 

-* 4 He + 4 He. 


The Carbon Cycle (See Fig. 11.6.) At tem¬ 
peratures below 20 million degrees the pp chain 
is the main energy production mechanism. At 
higher temperatures corresponding to stars with 
masses above 1.5 Mq, the carbon (CNO) cycle 
becomes dominant, because its reaction rate in¬ 
creases more rapidly with temperature. In the 
CNO cycle carbon, oxygen and nitrogen act as 
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Fig. 11.5 The 

proton-proton chain. In the 
ppl branch, four protons 
are transformed into one 
helium nucleus, two 
positrons, two neutrinos 
and radiation. The relative 
weights of the reactions are 
given for conditions in the 
Sun. The pp chain is the 
most important energy 
source in stars with mass 
below 1.5 Mq 



ppin branch 
0.1 % He 

very temperature sensitive 


I 


catalysts. The reaction cycle is the following: 


(1) 

^C-I^H- 

» 13 N + y, 

(2) 

13 n 

* 13 C + e+ + v e 

(3) 


* 14 N + y, 

(4) 

14 n + 3 H - 

» 15 0 + y, 

(5) 

15 0 

> 15 N+ y + v e . 

(6) 

15 N+ 1 H- 

* 12 C + 4 He. 


Reaction (4) is the slowest, and thus deter¬ 
mines the rate of the CNO cycle. At a tempera¬ 
ture of 20 million degrees the reaction time for 
the reaction (4) is a million years. 

The fraction of energy released as radiation in 
the CNO cycle is slightly smaller than in the pp 
chain, because more energy is carried away by 
neutrinos. 


The Triple Alpha Reaction As a result of the 
preceding reactions, the abundance of helium in 
the stellar interior increases. At a temperature 
above 10 8 degrees the helium can be transformed 
into carbon in the triple alpha reaction: 

(1) 4 He + 4 He o- 8 Be, 

(2) 8 Be + 4 He -* 12 C + y. 

Here 8 Be is unstable and decays into two he¬ 
lium nuclei or alpha particles in 2.6 x 10 16 sec¬ 
onds. The production of carbon thus requires the 
almost simultaneous collision of three particles. 
The reaction is often written 

3 4 He —»■ 12 C +y. 

Once helium burning has been completed, at 
higher temperatures other reactions become pos- 
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Fig. 11.6 The CNO cycle is catalysed by 12 C. It trans¬ 
forms four protons into a helium nucleus, two positrons, 
two neutrinos and radiation. It is the dominant energy 
source for stars more massive than 1.5 Mq 

sible, in which heavier elements up to iron and 
nickel are built up. Examples of such reactions 
are various alpha reactions and oxygen, carbon 
and silicon burning. 

Alpha Reactions During helium burning some 
of the carbon nuclei produced react with helium 
nuclei to form oxygen, which in turn reacts to 
form neon, etc. These reactions are fairly rare and 
thus are not important as stellar energy sources. 
Examples are 

12 C + 4 He -* 16 0+y, 

16 0 + 4 He 20 Ne + y, 

20 Ne + 4 He 24 Mg + y. 

Carbon Burning After the helium is exhausted, 
carbon burning sets in at the temperature (5-8) x 
10 10 K: 

12 C + 12 C 24 Mg + y 
23 Na + 

-* 20 Ne + 4 He 
—> 23 Mg + n 
-> 16 0 + 2 4 He. 


Oxygen Burning Oxygen is consumed at 
slightly higher temperatures in the reactions 

16 0+ 16 0-* 32 S + x 
-* 31 P+ ‘H 
-> 28 Si + 4 He 
-> 31 S + n 
-* 24 Mg + 2 4 He. 

Silicon Burning After several intermediate 
steps the burning of silicon produces nickel and 
iron. The total process may be expressed as 

28 Si + 28 Si —»■ 56 Ni + y, 

56 Ni 56 Fe + 2e+ + 2u e . 

When the temperature becomes higher than 
about 10 9 K, the energy of the photons becomes 
large enough to destroy certain nuclei. Such re¬ 
actions are called photonuclear reactions or pho¬ 
todissociations. 

The production of elements heavier than iron 
requires an input of energy, and therefore such 
elements cannot be produced by thermonuclear 
reactions. Elements heavier than iron are almost 
exclusively produced by neutron capture dur¬ 
ing the final violent stages of stellar evolution 
(Sect. 11.5). 

The rates of the reactions presented above can 
be determined by laboratory experiments or by 
theoretical calculations. Knowing them, one can 
calculate the rate at which energy is released per 
unit mass and time as a function of the density, 
temperature and chemical composition: 

e = e(T, p, X, Y, Z). (11.20) 

In reality the relative abundance of each of the 
heavier nuclei needs to be known, not just their 
total abundance Z. 


11.4 Stellar Models 

A theoretical stellar model is obtained if one 
solves the differential equations for stellar struc¬ 
ture. As we have already noted, the model is 
uniquely defined once the chemical composition 
and the mass of the star have been given. 
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Table 11.1 Properties of zero age main sequence stars. (T c = central temperature; p c = central density; M c ; = relative 
mass of convective interior; M ce = relative mass of convective envelope ) 


M[M e \ 

R [R 0 ] 

L [L 0 ] 

TA K] 

T c [10 6 K] 

Pc [kg/m 3 ] 

M ci [Ml 

M ce [M] 

30 

6.6 

140,000 

44,000 

36 

3000 

0.60 

0 

15 

4.7 

21,000 

32,000 

34 

6200 

0.39 

0 

9 

3.5 

4500 

26,000 

31 

7900 

0.26 

0 

5 

2.2 

630 

20,000 

27 

26,000 

0.22 

0 

3 

1.7 

93 

14,000 

24 

42,000 

0.18 

0 

1.5 

1.2 

5.4 

8100 

19 

95,000 

0.06 

0 

1.0 

0.87 

0.74 

5800 

14 

89,000 

0 

0.01 

0.5 

0.44 

0.038 

3900 

9.1 

78,000 

0 

0.41 


Stars just formed out of the interstellar me¬ 
dium are chemically homogeneous. When stel¬ 
lar models for homogeneous stars are plotted in 
the HR diagram, they fall along the lower edge of 
the main sequence. The theoretical sequence ob¬ 
tained in this way is called the zero age main se¬ 
quence, ZAMS. The exact position of the ZAMS 
depends on the initial chemical composition. For 
stars with an initial abundance of heavy elements 
like that in the Sun, the computed ZAMS is in 
good agreement with the observations. If the ini¬ 
tial value of Z is smaller, the theoretical ZAMS 
falls below the main sequence in the subdwarf 
region of the HR diagram. This is related to the 
classification of stars into populations I and II, 
which is discussed in Sect. 17.2. 

The theoretical models also provide an ex¬ 
planation for the mass-luminosity relation. The 
computed properties of zero age main sequence 
stars of different masses are given in Table 11.1. 
The chemical composition assumed is X = 0.71 
(hydrogen mass fraction), Y — 0.27 (helium) 
and Z = 0.02 (heavier elements), except for the 
30 Mq star, which has X = 0.70 and Y — 0.28. 
The luminosity of a one solar mass star is 0.74 Lq 
and the radius 0.87 Rq. Thus the Sun has bright¬ 
ened and expanded to some extent during its evo¬ 
lution. However, these changes are small and do 
not conflict with the evidence for a steady solar 
energy output. In addition the biological evidence 
only goes back about 3000 million years. 

The model calculations show that the central 
temperature in the smallest stars ( M ~ 0.08 Mq) 
is about 4 x 10 6 K, which is the minimum tem¬ 
perature required for the onset of thermonuclear 


reactions. In the biggest stars (M ^ 50 Mq), the 
central temperature reaches 4 x 10 7 K. 

The changes in chemical composition caused 
by the nuclear reactions can be computed, since 
the rates of the various reactions at different 
depths in the star are known. For example, the 
change A A of the hydrogen abundance in the 
time interval A t is proportional to the rate of en¬ 
ergy generation e and to At: 

AXoc-eA t. (11.21) 

The constant of proportionality is clearly the 
amount of hydrogen consumed per unit energy 
[kg/J], The value of this constant of proportion¬ 
ality is different for the pp chain and the CNO 
cycle. Therefore the contribution from each re¬ 
action chain must be calculated separately in 
(11.21). For elements that are produced by the 
nuclear reactions, the right-hand side contribution 
in (11.21) is positive. If the star is convective, the 
change in composition is obtained by taking the 
average of (11.21) over the convection zone. 


Box 11.1 (Gas Pressure and Radiation Pres¬ 
sure) Let us consider noninteracting particles 
in a rectangular box. The particles may also be 
photons. Let the sides of the box be Ax, Ay 
and A", and the number of particles, N. The 
pressure is caused by the collisions of the par¬ 
ticles with the sides of the box. When a par¬ 
ticle hits a wall perpendicular to the x axis, 
its momentum in the x direction, p x , changes 
by Ap — 2p x . The particle will return to the 
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same wall after the time At — 2Ax/v x . Thus 
the pressure exerted by the particles on the wall 
(surface area A = Ay A") is 

P = F = L Ap/At = Y. PxVx 
A A Ax Ay Az 

_ N{p x v x ) 

V 

where V = Ax Ay A" is the volume of the box 
and the angular brackets represent the aver¬ 
age value. The momentum is p x = mv x (where 
m — hv/c 2 for photons), and hence 

p = Nm{v 2 ) 

V 

Suppose the velocities of the particles are 
isotropically distributed. Then (v 2 ) = ( v 2 ) = 
(v 2 ), and thus 

(« 2 H$+($+(«?) = « 


and 

Nm{ v 2 ) 

~ 3F 

If the particles are gas molecules, the energy 
of a molecule is e = j mv 2 . The total energy 
of the gas is E — N(s) = ^Nm(v 2 ), and the 
pressure may be written 

2 F 

P — -(gas). 

3 V 

If the particles are photons, they move with the 
speed of light and their energy is s — me 2 . The 
total energy of a photon gas is thus E — N (e) = 
Nmc 2 and the pressure is 

1 E 

P — - — (radiation). 


According to (4.7), (4.4) and (5.16) the en¬ 
ergy density of blackbody radiation is 



where a — 4a /c is the radiation constant. Thus 
the radiation pressure is 


P rad = aT 4 /3. 


Box 11.2 (The Pressure of a Degenerate Gas) 
A gas where all available energy levels up to 
a limiting momentum po , known as the Fermi 
momentum, are filled is called degenerate. We 
shall determine the pressure of a completely 
degenerate electron gas. 

Let the volume of the gas be V. We con¬ 
sider electrons with momenta in the range 
[p, p + d p]. Their available phase space vol¬ 
ume is 4-Ttp 2 dpV. According to the Heisen¬ 
berg uncertainty relation the elementary vol¬ 
ume in phase space is li 3 and, according to the 
Pauli exclusion principle, this volume can con¬ 
tain two electrons with opposite spins. Thus the 
number of electrons in the momentum interval 
[ P , P + dp] is 

Aitp 2 dp V 

dN = 2U l^- 

The total number of electrons with momenta 
smaller than po is 

f 8t rV fP° , 8ttV , 

N = J dN = l^l ? dp= 3 ? 1 ’"' 

Hence the Fermi momentum po is 


P o = 


an(vf 


Nonrelativistic Gas The kinetic energy of an 
electron is e — p 2 /2m e . The total energy of the 
gas is 



4 itV 
m e h 3 



P 4 dp 


4ttV 

5m e h 2 


Po- 


Introducing the expression for the Fermi mo¬ 
mentum po, one obtains 


E 


7 T (3 \ 5/3 /! 2 / A \ 5/3 

40 \7r / m e \V/ 


The pressure of the gas was derived in 
Box 11.1: 


P 


2 E 

3 V 
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1 / 3 \ 2/3 h 2 / N\ 5/ 3 

20 \7T/ /W e \ V / 
(nonrelativistic). 

Here N/ V is the number density of electrons. 


Relativistic Gas If the density becomes so 
large that the electron kinetic energy e corre¬ 
sponding to the Fermi momentum exceeds the 
rest energy m e c , the relativistic expression for 
the electron energy has to be used. In the ex¬ 
treme relativistic case s = cp and the total en¬ 
ergy 


f 8t rcV f Po , 

E = J eiN = -^l " 

2jtcV 4 
= ^Po- 

The expression for the Fermi momentum re¬ 
mains unchanged, and hence 


E 


7T 

¥ 



The pressure of the relativistic electron gas is 
obtained from the formula derived for a photon 
gas in Box 11.1: 


P 


1 E 
3 V 




N\* 

v) 


(relativistic). 


We have obtained the nonrelativistic and ex¬ 
treme relativistic approximations to the pres¬ 
sure. In intermediate cases the exact expression 
for the electron energy, 

e — [m e c + p c ) 


has to be used. 

The preceding derivations are rigorously 
valid only at zero temperature. However, the 
densities in compact stars are so high that the 
effects of a nonzero temperature are negligible, 
and the gas may be considered completely de¬ 
generate. 


11.5 Examples 

Example 11.1 (The Gravitational Acceleration at 
the Solar Surface) The expression for the gravi¬ 
tational acceleration is 

GMq 

Using the solar mass M = 1.989 x 10 30 kg and 
radius R = 6.96 x 10 8 m, one obtains 

g = 274 m s -2 ^ 28 g 0 > 

where go = 9.81 ms -2 is the gravitational accel¬ 
eration at the surface of the Earth. 

Example 11.2 (The Average Density of the Sun) 
The volume of a sphere with radius R is 



thus the average density of the Sun is 

M 3 M , 

p= — = -- ss 1410 kgm“ 3 . 

y V 4 ttR 3 

Example 11.3 (Pressure at Half the Solar Radius) 
The pressure can be estimated from the condition 
for the hydrostatic equilibrium (11.1). Suppose 
the density is constant and equal to the average 
density ~p. Then the mass within the radius r is 



and the hydrostatic equation can be written 

d P GM r ~p 4nGp 2 r 

d¥ ~ ~ 3 ' 

This can be integrated from half the solar radius, 
r = Rq/ 2, to the surface, where the pressure van¬ 
ishes: 
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which gives 


the radiation pressure given by (1 1.1 1) is 


P = 1 -jzGT> 2 R% 

« -7T6.67 x 10“ n x 1410 2 
2 

x (6.96 x 10 8 ) 2 N/m 2 
« 10 14 Pa. 

This estimate is extremely rough, since the den¬ 
sity increases strongly inwards. 


1 4 

Pmd= 3«r 4 

= - x 7.564 x 10“ 16 x (5 x 10 6 ) 4 
3 

«2x 10 11 Pa. 

This is about a thousand times smaller than the 
gas pressure estimated in Example 11.3. Thus it 
confirms that the use of the perfect gas law in Ex¬ 
ample 11.5 was correct. 


Example 11.4 (The Mean Molecular Weight of 
the Sun) In the outer layers of the Sun the ini¬ 
tial chemical composition has not been changed 
by nuclear reactions. In this case one can use the 
values X = 0.71, Y = 0.27 and Z = 0.02. The 
mean molecular weight (11.10) is then 

1 

M “ 2 x 0.71 + 0.75 x 0.27 + 0.5 x 0.02 


Example 11.7 (The Path of a Photon from the 
Centre of a Star to Its Surface) Radiative en¬ 
ergy transport can be described as a random walk, 
where a photon is repeatedly absorbed and re¬ 
emitted in a random direction. Let the step length 
of the walk (the mean free path) be d. Consider, 
for simplicity, the random walk in a plane. After 
one step the photon is absorbed at 


& 0.61. 


* 1 =^ 00801 , yi=<7sin0i, 


When the hydrogen is completely exhausted, 
X — 0 and Y = 0.98, and hence 

1 

a =-^ 1.34. 

^ 0.75 x 0.98 + 0.5 x 0.02 

Example 11.5 (The Temperature of the Sun 
at r = Rq/2) Using the density from Exam¬ 
ple 11.2 and the pressure from Example 11.3, 
the temperature can be estimated from the per¬ 
fect gas law (11.8). Assuming the surface value 
for the mean molecular weight (Example 11 .4), 
one obtains the temperature 

T = ftmuP 
kp 

_ 0.61 x 1.67 x 10- 27 x 1.0 x 10 14 

~ 1.38 x 10 -23 x 14 1 0 

^ 5 x 10 6 K. 


where 0\ is an angle giving the direction of the 
step. After N steps the coordinates are 

N N 

x — d cos 0;, y = d sin 0;, 

1=1 1=1 


and the distance from the starting point is 


r 2 = x 2 + y 2 


= d 


- / N \ 2 / N \ 2 " 

I ^2 COS0, 1 + ( sin0, ) 


The first term in square brackets can be written 
/ N \ 2 


COS 0; 


1 


= (COS 01 + COS 02 H-b COS 6nY 


Example 11.6 (The Radiation Pressure in the 
Sun at r = Rq/2) In the previous example we 
found that the temperature is T ~ 5 x 10 6 K. Thus 


N 

= cos 2 0/ + cos 6j cos 9j. 
1 i+j 
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Since the directions (9, are randomly distributed 
and independent, 

Y, cos 6i cos Oj = 0. 
i±j 

The same result applies for the second term in 
square brackets. Thus 

N 

r 2 — d 2 ^2( cos 2 0i + sin 2 6j) = Nd 2 . 
l 



After N steps the photon is at the distance r — 
d\f~N from the starting point. Similarly, a drunk¬ 
ard taking a hundred one-metre steps at random 
will have wandered ten metres from his/her start¬ 
ing point. The same result applies in three dimen¬ 
sions. 

The time taken by a photon to reach the sur¬ 
face from the centre depends on the mean free 
path d = 1 /or = I /k/). The value of k at half 
the solar radius can be estimated from the val¬ 
ues of density and temperature obtained in Ex¬ 
amples 11.2 and 11.5. The mass absorption co¬ 
efficient in these conditions is found to be k = 
10 m 2 /kg. (We shall not enter on how it is calcu¬ 
lated.) The photon mean free path is then 

1 -4 

d = — 10 4 m. 

Kp 


This should be a reasonable estimate in most 
of the solar interior. Since the solar radius r — 
10 9 m, the number of steps needed to reach the 
surface will b e N = (r/d) 2 = 10 26 . The total path 
travelled by the photon is s — Nd = 10 22 m, and 
the time taken is t = s/c = 10 6 years; a more 
careful calculation gives t — 10 7 years. Thus it 
takes 10 million years for the energy generated 
at the centre to radiate into space. Of course the 
radiation that leaves the surface does not consist 
of the same gamma photons that were produced 
near the centre. The intervening scattering, emis¬ 
sion and absorption processes have transformed 
the radiation into visible light (as can easily be 
seen). 


11.6 Exercises 

Exercise 11.1 How many hydrogen atoms are 
there in the Sun per each helium atom? 

Exercise 11.2 (a) How many pp reactions take 
place in the Sun every second? The luminosity 
of the Sun is 3.9 x 10 26 W, the mass of a pro¬ 
ton is 1.00728 amu, and that of the a particle 
4.001514 amu (1 amu is 1.6604 x 10~ 27 kg). 

(b) How many neutrinos produced in these pp 
reactions will hit the Earth in one second? 

Exercise 11.3 The mass absorption coefficient 
of a neutrino is k = 10 -21 m 2 kg . Find the 
mean free path at the centre of the Sun. 
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In the preceding chapter we have seen how one 
can compute the evolution of a star by starting 
from a homogeneous model representing a newly 
formed system. When the chemical composition 
of the star changes with time, a new model is 
computed each time. In this chapter we shall con¬ 
sider the theoretical evolutionary paths of sys¬ 
tems with various masses and see how the com¬ 
puted evolution explains the observational data. 
The following discussion is rather qualitative, 
since the details of the theoretical calculations 
are too involved for the present book. Also, the 
different evolutionary tracks would become pro¬ 
hibitively complicated if more than the very basic 
stellar properties were included. 


12.1 Evolutionary Time Scales 


be consumed before other, more rapid evolution¬ 
ary mechanisms set in. Since 0.7 % of the rest 
mass is turned into energy in hydrogen burning, 
the nuclear time scale will be 




0.007 x 0.1 Me 2 
L 


( 12 . 1 ) 


For the Sun one obtains the nuclear time scale 
10 10 years, and thus 




M/Mq 

L/Lq 


x 10 10 a. 


( 12 . 2 ) 


This gives the nuclear time scale as a function 
of the mass M and luminosity L of a given star. 
For example, if the mass is 30 Mq, one obtains t n 
about 2 million years. The reason for the shorter 
time scale is that the stellar luminosity strongly 
increases for higher masses (Table 12.1). 


Changes in a star may take place on quite differ¬ 
ent time scales at different evolutionary phases. 
There are three important basic time scales: the 
nuclear time scale t n , the thermal time scale r t and 
the dynamical or freefall time scale fj. 

The Nuclear Time Scale The time in which 
a star radiates away all the energy that can be 
released by nuclear reactions is called the nu¬ 
clear time scale. An estimate of this time can be 
obtained if one calculates the time in which all 
available hydrogen is turned into helium. On the 
basis of theoretical considerations and evolution¬ 
ary computations it is known that only just over 
10 % of the total mass of hydrogen in the star can 


The Thermal Time Scale The time in which 
a star would radiate away all its thermal energy 
if the nuclear energy production were suddenly 
turned off is called the thermal time scale. This is 
also the time it takes for radiation from the centre 
to reach the surface. According to the virial theo¬ 
rem (6.51) the kinetic energy of the thermal mo¬ 
tion of the gas particles equals half of the poten¬ 
tial energy. Thus he thermal time scale is roughly 
may be estimated as 


k 


0.5 GM 2 /R 

Z 

( M/Mq) 2 
( R/RoKL/Lo ) 


x 2 x 10 7 a, 


(12.3) 
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Table 12.1 Stellar lifetimes (unit 10 6 years) 


Mass [M q ] 

Spectral type on the 
main sequence 

Contraction to main 
sequence 

Main sequence 

Main sequence 
to red giant 

Red giant 

30 

05 

0.02 

4.9 

0.55 

0.3 

15 

B0 

0.06 

10 

1.7 

2 

9 

B2 

0.2 

22 

0.2 

5 

5 

B5 

0.6 

68 

2 

20 

3 

A0 

3 

240 

9 

80 

1.5 

F2 

20 

2000 

280 


1.0 

G2 

50 

10,000 

680 


0.5 

M0 

200 

30,000 



0.1 

M7 

500 

10 7 




where G is the constant of gravity and R the stel¬ 
lar radius. For the Sun the thermal time scale is 
about 20 million years or 1/500 of the nuclear 
time scale. 


The Dynamical Time Scale The third and 
shortest time scale is the time it would take a star 
to collapse if the pressure supporting it against 
gravity were suddenly removed. It can be esti¬ 
mated from the time it would take for a particle 
to fall freely from the stellar surface to the cen¬ 
tre. This is half of the period given by Kepler’s 
third law, where the semimajor axis of the orbit 
corresponds to half the stellar radius R: 


2,^2)= JZ. 

2 V GM V GM 


(12.4) 


The dynamical time scale of the Sun is about half 
an hour. 

The ordering of the time scales is normally 
like that in the Sun, i.e, t& •C f t t n . 


12.2 The Contraction of Stars 

Towards the Main Sequence 

The formation and subsequent gravitational col¬ 
lapse of condensations in the interstellar medium 
will be considered in a later chapter. Here we 
shall follow the behaviour of such a protostar, 
when it is already in the process of contraction. 

When a cloud contracts, gravitational potential 
energy is released and transformed into thermal 


energy of the gas and into radiation. Initially the 
radiation can propagate freely through the mate¬ 
rial, because the density is low and the opacity 
small. Therefore most of the liberated energy is 
radiated away and the temperature does not in¬ 
crease. The contraction takes place on the dynam¬ 
ical time scale; the gas is falling freely inwards. 

The density and the pressure increase most 
rapidly near the centre of the cloud. As the den¬ 
sity increases, so does the opacity. A larger frac¬ 
tion of the released energy is then turned into 
heat, and the temperature begins to rise. This 
leads to a further increase in the pressure that is 
resisting the free fall. The contraction of the cen¬ 
tral part of the cloud slows down. The outer parts, 
however, are still falling freely. 

At this stage, the cloud may already be con¬ 
sidered a protostar. It consists mainly of hydrogen 
in molecular form. When the temperature reaches 
1800 K, the hydrogen molecules are dissociated 
into atoms. The dissociation consumes energy, 
and the rise in temperature is slowed down. The 
pressure then also grows more slowly and this in 
turn means that the rate of contraction increases. 
The same sequence of events is repeated, first 
when hydrogen is ionised at 10 4 K, and then 
when helium is ionised. When the temperature 
has reached about 10 5 K, the gas is essentially 
completely ionised. 

The contraction of a protostar only stops when 
a large fraction of the gas is fully ionised in the 
form of plasma. The star then settles into hy¬ 
drostatic equilibrium. Its further evolution takes 
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Fig. 12.1 The paths in the 
HR diagram of stars 
contracting to the main 
sequence on the thermal 
time scale. After a rapid 
dynamical collapse the 
stars settle on the Hayashi 
track and evolve towards 
the main sequence on the 
thermal time scale. 

(Models by Iben, I. (1965): 
Astrophys. J. 141, 993) 



place on the thermal time scale, i.e. much more 
slowly. The radius of the protostar has shrunk 
from its original value of about 100 au to about 
1/4 au. It will usually be located inside a larger 
gas cloud and will be accreting material from its 
surroundings. Its mass therefore grows, and the 
central temperature and density increase. 

The temperature of a star that has just reached 
equilibrium is still low and its opacity corre¬ 
spondingly large. Thus it will be convective in its 
centre. The convective energy transfer is quite ef¬ 
ficient and the surface of the protostar will there¬ 
fore be relatively bright. 

We now describe the evolution in the HR di¬ 
agram. Initially the protostar will be faint and 


cool, and it will reside at the lower far right in 
the HR diagram (outside Fig. 12.1). During the 
collapse its surface rapidly heats up and bright¬ 
ens and it moves to the upper right of Fig. 12.1. 
At the end of the collapse the star will settle at 
a point corresponding to its mass on the Hayashi 
track. The Hayashi track (Fig. 12.1) gives the lo¬ 
cation in the HR diagram of completely convec¬ 
tive stars. Stars to its right cannot be in equi¬ 
librium and will collapse on the dynamic time 
scale. 

The star will now evolve almost along the 
Hayashi track on the thermal time scale. In the 
HR diagram it moves almost vertically down¬ 
wards, its radius decreases and its luminosity 
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Fig. 12.2 Herbig -Haro 
object number 555 lies at 
the end of the “elephant's 
trunk” in Pelican Nebula in 
Cygnus. The small wings 
are shockwaves, which 
give evidence for powerful 
outflows from newly 
formed stars embedded 
within the clouds. (Photo 
University of Colorado, 
University of Hawaii and 
NOAO/AURA/NSF) 



drops (Fig. 12.1). As the temperature goes on in¬ 
creasing in its centre, the opacity diminishes and 
energy begins to be transported by radiation. The 
mass of the radiative region will gradually grow 
until hnally most of the star is radiative. By then 
the central temperature will have become so large 
that nuclear reactions begin. Previously all the 
stellar energy had been released potential energy, 
but now the nuclear reactions make a growing 
contribution and the luminosity increases. The 
stellar surface temperature will also increase and 
the star will move slightly upwards to the left 
in the HR diagram. In massive stars, this turn to 
the left occurs much earlier, because their central 
temperatures are higher and the nuclear reactions 
are initiated earlier. 

For solar mass stars, the rapid collapse of the 
protostellar cloud only lasts for a few hundred 
years. The final stage of condensation is much 
slower, lasting several tens of millions of years. 
This length of time strongly depends on the stel¬ 
lar mass because of the luminosity dependence of 
the thermal time scale. A 15 Mq star condenses to 
the main sequence in 60,000 years, whereas for 
a 0.1 Mq star, the time is hundreds of millions of 
years. 

Some of the hydrogen burning reactions start 
already at a few million degrees. For example, 


lithium, beryllium and boron burn to helium in 
the ppll and ppIII branches of the pp chain long 
before the complete chain has turned on. Be¬ 
cause the star is convective and thus well mixed 
during the early stages, even its surface material 
will have been processed in the centre. Although 
the abundances of the above-mentioned elements 
are small, they give important information on the 
central temperature. 

The beginning of the main sequence phase is 
marked by the start of hydrogen burning in the pp 
chain at a temperature of about 4 million degrees. 
The new form of energy production completely 
supersedes the energy release due to contraction. 
As the contraction is halted, the star makes a few 
oscillations in the HR diagram, but soon settles in 
an equilibrium and the long, quiet main sequence 
phase begins. 

It is difficult to observe stars during contrac¬ 
tion, because the new-born stars are usually hid¬ 
den among dense clouds of dust and gas. How¬ 
ever, some condensations in interstellar clouds 
have been discovered and near them, very young 
stars. One example are the TTauri stars. Their 
lithium abundance is relatively high, which indi¬ 
cates that they are newly formed stars in which 
the central temperature has not yet become large 
enough to destroy lithium. Near the TTauri stars, 
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small, bright, star-like nebulae, Herbig-Haro ob¬ 
jects, have been discovered. These are thought to 
be produced in the interaction between a stellar 
wind and the surrounding interstellar medium. 


12.3 The Main Sequence Phase 

The main sequence phase is that evolutionary 
stage in which the energy released by the burn¬ 
ing of hydrogen in the core is the only source 
of stellar energy. During this stage, the star is in 
stable equilibrium, and its structure changes only 
because its chemical composition is gradually al¬ 
tered by the nuclear reactions. Thus the evolu¬ 
tion takes place on a nuclear time scale, which 
means that the main sequence phase is the longest 
part of the life of a star. For example, for a so¬ 
lar mass star, the main sequence phase lasts for 
about 10,000 million years. More massive stars 
evolve more rapidly, because they radiate much 
more power. Thus the main sequence phase of 
a 15 solar mass star is only about 10 million 
years. On the other hand, less massive stars have 
a longer main sequence lifetime: a 0.25 Mq star 
spends about 70,000 million years on the main 
sequence. 

Since stars are most likely to be found in the 
stage of steady hydrogen burning, the main se¬ 
quence in the HR diagram is richly populated, in 
particular at its low-mass end. The more massive 
upper main sequence stars are less abundant be¬ 
cause of their shorter main sequence lifetimes. 

If the mass of a star becomes too large, the 
force of gravity can no longer resist the radiation 
pressure. Stars more massive than this upper limit 
cannot form, because they cannot accrete addi¬ 
tional mass during the contraction phase. Theo¬ 
retical computations give a limiting mass of about 
120 M 0 ; the most massive stars observed are 
claimed to be about 150 Mq. These values are, 
however, uncertain. 

There is also a lower-mass limit of the main 
sequence. Stars below 0.08 Mq never become hot 
enough for hydrogen burning to begin. They can 
still generate some luminosity from the burning 
of deuterium, but this energy source is rapidly ex¬ 
hausted. These brown dwarfs have surface tem¬ 
peratures in the range of 1000-2000 K. Hundreds 


of brown dwarfs have now been found in dedi¬ 
cated surveys. The lower limit for brown dwarf 
mass is sometimes taken to be about 0.015 Mq, 
corresponding to the minimum mass for deu¬ 
terium burning. 

If the mass is even lower there are no nu¬ 
clear sources of energy. The smallest protostars 
therefore contract to planet-like dwarfs. During 
the contraction phase they radiate because poten¬ 
tial energy is released, but eventually they be¬ 
gin to cool. In the HR diagram such stars first 
move almost vertically downwards and then fur¬ 
ther downwards to the right. 

Is there a difference between the lowest-mass 
brown dwarfs and the most massive planets? If 
brown dwarfs have formed by gravitational col¬ 
lapse and fragmentation as described in the pre¬ 
vious section and in Sect. 15.4, there is no rea¬ 
son not to count them as stars, although they are 
not producing energy by nuclear reactions. Plan¬ 
ets in contrast are thought to form much more 
slowly by the clumping of solids and accretion of 
gas in a protoplanetary disk. The objects formed 
by this mechanism start out with a quite differ¬ 
ent structure. Whether such a clear-cut distinction 
between the formation mechanisms of dark stars 
and planets really can be made still remains an 
open question. 

The Upper Main Sequence The stars on the 
upper main sequence are so massive and their 
central temperature so high that the CNO cycle 
can operate. On the lower main sequence the en¬ 
ergy is produced by the pp chain. The pp chain 
and the CNO cycle are equally efficient at a tem¬ 
perature of 18 million degrees, corresponding to 
the central temperature of a 1.5 Mq star. The 
boundary between the upper and the lower main 
sequence corresponds roughly to this mass. 

The energy production in the CNO cycle is 
very strongly concentrated at the core. The out¬ 
ward energy flux will then become very large, 
and can no longer be maintained by radiative 
transport. Thus the upper main sequence stars 
have a convective core, i.e. the energy is trans¬ 
ported by material motions. These keep the mate¬ 
rial well mixed, and thus the hydrogen abundance 
decreases uniformly with time within the entire 
convective region. 
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Fig. 12.3 Stellar 
evolutionary paths in the 
HR diagram at the main 
sequence phase and later. 
On the main sequence, 
bounded by dashed curves, 
the evolution is on the 
nuclear time scale. The 
post-main sequence 
evolution to the red giant 
phase is on the thermal 
time scale. The point 
marked He corresponds to 
helium ignition and in 
low-mass stars the helium 
flash. The straight line 
shows the location of stars 
with the same radius. 

(Iben, I. (1967): Annual 
Rev. Astron. Astrophys. 5, 
571; data for 30 Mq from 
Stothers, R. (1966): 
Astrophys. J. 143, 91) 



Outside the core, there is radiative equilib¬ 
rium, i.e. the energy is carried by radiation and 
there are no nuclear reactions. Between the core 
and the envelope, there is a transition region 
where the hydrogen abundance decreases in¬ 
wards. 

The mass of the convective core will gradu¬ 
ally diminish as the hydrogen is consumed. In the 
HR diagram the star will slowly shift to the upper 
right as its luminosity grows and its surface tem¬ 
perature decreases (Fig. 12.2). When the central 
hydrogen supply becomes exhausted, the core of 
the star will begin to shrink rapidly. The surface 
temperature will increase and the star will quickly 
move to the upper left. Because of the contraction 


of the core, the temperature in the hydrogen shell 
just outside the core will increase. It rapidly be¬ 
comes high enough for hydrogen burning to set 
in again. 

The Lower Main Sequence On the lower main 
sequence, the central temperature is lower than 
for massive stars, and the energy is generated by 
the pp chain. Since the rate of the pp chain is not 
as sensitive to temperature as that of the CNO cy¬ 
cle, the energy production is spread over a larger 
region than in the more massive stars (Fig. 12.3). 
In consequence, the core never becomes convec- 
tively unstable, but remains radiative. 
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In the outer layers of lower main sequence 
stars, the opacity is high because of the low tem¬ 
perature. Radiation can then no longer carry all 
the energy, and convection will set in. The struc¬ 
ture of lower main sequence stars is thus oppo¬ 
site to that of the upper main sequence: the centre 
is radiative and the envelope is convective. Since 
there is no mixing of material in the core, the hy¬ 
drogen is most rapidly consumed at the very cen¬ 
tre, and the hydrogen abundance increases out¬ 
wards. 

As the amount of hydrogen in the core de¬ 
creases, the star will slowly move upwards in 
the HR diagram, almost along the main sequence 
(Fig. 12.2). It becomes slightly brighter and hot¬ 
ter, but its radius will not change by much. The 
evolutionary track of the star will then bend to the 
right, as hydrogen in the core nears its end. Even¬ 
tually the core is almost pure helium. Hydrogen 
will continue to burn in a thick shell around the 
core. 

Stars with masses between 0.08 Mq and 
0.26 Mq have a very simple evolution. During 
their whole main sequence phase they are fully 
convective, which means that their entire hydro¬ 
gen content is available as fuel. These stars evolve 
very slowly toward the upper left in the HR dia¬ 
gram. Finally, when all their hydrogen has burned 
to helium, they contract to become white dwarfs. 


12.4 The Giant Phase 

The main-sequence phase of stellar evolution 
ends when hydrogen is exhausted at the centre. 
The star then settles in a state in which hydrogen 
is burning in a shell surrounding a helium core. 
As we have seen, the transition takes place grad¬ 
ually in lower main-sequence stars, giving rise to 
the Subgiant Branch in the HR diagram, while 
the upper main-sequence stars make a rapid jump 
at this point. 

The mass of the helium core is increased by 
the hydrogen burning in the shell. This leads to 
the expansion of the envelope of the star, which 
moves almost horizontally to the right in the HR 
diagram. As the convective envelope becomes 
more extensive, the star approaches the Hayashi 
track. Since it cannot pass further to the right, and 


since its radius continues to grow, the star has to 
move upwards along the Hayashi track towards 
larger luminosities (Fig. 12.2). The star has be¬ 
come a red giant. 

In low-mass stars (M < 2.3 Mq), as the mass 
of the core grows, its density will eventually be¬ 
come so high that it becomes degenerate. The 
central temperature will continue to rise. The 
whole helium core will have a uniform temper¬ 
ature because of the high conductivity of the de¬ 
generate gas. If the mass of the star is larger than 
0.26 Mq the central temperature will eventually 
reach about 100 million degrees, which is enough 
for helium to burn to carbon in the triple alpha 
process. 

Helium burning will set in simultaneously in 
the whole central region and will suddenly raise 
its temperature. Unlike a normal gas, the degener¬ 
ate core cannot expand, although the temperature 
increases (cf. (11.16)), and therefore the increase 
in temperature will only lead to a further accel¬ 
eration of the rate of the nuclear reactions. When 
the temperature increases further, the degeneracy 
of the gas is removed and the core will begin to 
expand violently. Only a few seconds after the ig¬ 
nition of helium, there is an explosion, the helium 
flash. 

The energy from the helium flash is absorbed 
by the outer layers, and thus it does not lead to 
the complete disruption of the star. In fact the lu¬ 
minosity of the star drops in the flash, because 
when the centre expands, the outer layers con¬ 
tract. The energy released in the flash is turned 
into potential energy of the expanded core. Thus 
after the helium flash, the star settles into a new 
state, where helium is steadily burning to carbon 
in a nondegenerate core. 

After the helium flash the star finds itself on 
the horizontal giant branch in the HR diagram. 
The exact position of a star on the horizontal 
branch after the helium flash is a sensitive func¬ 
tion of its envelope mass. This in turn depends 
on the amount of mass lost by the star in the he¬ 
lium flash, which can vary randomly from star 
to star. While the luminosity does not vary much 
along the horizontal branch, the effective temper¬ 
atures are higher for stars with less mass in the 
envelope. The horizontal branch is divided into 
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a blue and a red part by a gap corresponding to 
the pulsational instability leading to RR Lyrae 
variables (see Sect. 14.2). The form of the hori¬ 
zontal branch for a collection of stars depends on 
their metal-abundance, in the sense that a lower 
metal abundance is related to a more prominent 
blue horizontal branch. Thus the blue horizon¬ 
tal branch in globular clusters with low metal- 
abundances is strong and prominent (Sect. 16.3). 
For stars with solar element abundances the hori¬ 
zontal branch is reduced to a short stump, the red 
clump , where it joins the red giant branch. 

In intermediate-mass stars (2.3 M© < M < 
8 Mq), the central temperature is higher and the 
central density lower, and the core will therefore 
not be degenerate. Thus helium burning can set in 
non-catastrophically as the central regions con¬ 
tract. As the importance of the helium burning 
core increases, the star first moves away from 
the red giant branch towards bluer colours, but 
then loops back towards the Hayashi track again. 
An important consequence of these blue loops 
is that they bring the star into the strip in the 
HR diagram corresponding to the cepheid insta¬ 
bility (Sect. 14.2). This gives rise to the classi¬ 
cal cepheid variables, which are of central impor¬ 
tance for determining distances in the Milky Way 
and to the nearest galaxies. 

In the most massive stars helium burning starts 
before the star has had time to reach the red giant 
branch. Some stars will continue moving to the 
right in the HR diagram. For others this will pro¬ 
duce a massive stellar wind and a large mass loss. 
Stars in this evolutionary phase, such as P Cygni 
and r] Carinae, are known as luminous blue vari¬ 
ables, LBV, and are among the brightest in the 
Milky Way. If the star can retain its envelope it 
will become a red supergiant. Otherwise it will 
turn back towards the blue side of the HR dia¬ 
gram, ending up as a Wolf-Rayet star. 

The Asymptotic Giant Branch The evolu¬ 
tion that follows core helium burning depends 
strongly on the stellar mass. The mass determines 
how high the central temperature can become and 
the degree of degeneracy when heavier nuclear 
fuels are ignited. 

When the central helium supply is exhausted, 
helium will continue to burn in a shell, while 


the hydrogen burning shell is extinguished. In the 
HR diagram the star will move towards lower ef¬ 
fective temperature and higher luminosity. This 
phase is quite similar to the previous red giant 
phase of low-mass stars, although the temper¬ 
atures are slightly hotter. For this reason it is 
known as the asymptotic giant branch, AGB. 

After the early phase, when the helium shell 
catches up with the extinguished hydrogen shell, 
the AGB star enters what is known as the ther¬ 
mally pulsing phase, where hydrogen and helium 
shell burning alternate. A configuration with two 
burning shells is unstable, and in this phase the 
stellar material may become mixed or matter may 
be ejected into space in a shell, like that of a plan¬ 
etary nebula. 

The thermally pulsing AGB continues until ra¬ 
diation pressure has led to the complete expul¬ 
sion of the outer layers into a planetary nebula. 
Low- and intermediate-mass giants (M < 8 Mq) 
never become hot enough to ignite carbon burn¬ 
ing in the core, which remains as a carbon- 
oxygen white dwarf (Fig. 12.6). 

The End of the Giant Phase After the end of 
helium burning the evolution of a star changes 
character. This is because the nuclear time scale 
at the centre becomes short compared to the ther¬ 
mal time scale of the outer layers. Secondly, 
the energy released in nuclear reactions will be 
carried away by neutrinos, instead of being de¬ 
posited at the centre. In consequence, while the 
thermonuclear burning follows the same pattern 
as hydrogen and helium burning, the star as a 
whole does not have time to react immediately. 

In stars with masses around 10 Mq either car¬ 
bon or oxygen may be ignited explosively just 
like helium in low-mass stars: there is a carbon 
or oxygen flash. This is much more powerful than 
the helium flash, and may make the star explode 
as a supernova (Sects. 12.5 and 13.3). 

For even larger masses the core remains non¬ 
degenerate and burning will start non-catastro- 
phically as the core goes on contracting and 
becoming hotter. First carbon burning and sub¬ 
sequently oxygen and silicon burning (see 
Sect. 10.3) will be ignited. As each nuclear fuel 
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Fig. 12.4 Energy transport in the main sequence phase, 
(a) The least massive stars (M < 0.26 Mq) are convective 
throughout, (b) For 0.26 Mq < M < 1.5 Mq the core is 
radiative and the envelope convective, (c) Massive stars 
(M >1.5 Mq) have a convective core and a radiative en¬ 
velope 

is exhausted in the centre, the burning will con¬ 
tinue in a shell. The star will thus contain sev¬ 
eral nuclear burning shells. At the end the star 
will consist of a sequence of layers differing in 
composition, in massive stars (more massive than 
15 Mq) all the way up to iron. 

The central parts of the most massive stars 
with masses larger than 15 Mq burn all the way 
to iron 56 Fe. All nuclear sources of energy will 
then be completely exhausted. The structure of 
a 30 solar mass star at this stage is schemati¬ 
cally shown in Fig. 12.4. The star is made up 
of a nested sequence of zones bounded by shells 
burning silicon 28 Si, oxygen 16 0 and carbon l2 C, 
helium 4 He and hydrogen 1 H. However, this is 
not a stable state, since the end of nuclear re¬ 
actions in the core means that the central pres¬ 
sure will fall, and the core will collapse. Some 
of the energy released in the collapse goes into 
dissociating the iron nuclei first to helium and 
then to protons and neutrons. This will further 
speed up the collapse, just like the dissociation 
of molecules speeds up the collapse of a pro¬ 
tostar. The collapse takes place on a dynamical 
time scale, which, in the dense stellar core, is 
only a fraction of a second. The outer parts will 
also collapse, but more slowly. In consequence, 
the temperature will increase in layers contain¬ 
ing unburnt nuclear fuel. This will burn explo¬ 
sively, releasing immense amounts of energy in 
a few seconds, principally in the form of neutri¬ 
nos. 

The final stages of stellar evolution may be 
described as an implosion of the core, which is 


briefly halted every time a new source of nu¬ 
clear fuel becomes available for burning It is still 
an open problem how exactly the energy released 
in this collapse is transformed into the disrup¬ 
tion of the entire star and the ejection of its outer 
layers. It is also still unclear whether in a given 
case the remnant will be a neutron star or a black 
hole. 

Although the exact mechanism is not yet un¬ 
derstood, the end-point of the evolution of stars 
more massive the about 8Mq is that the outer 
layers explode as a supernova. In the dense cen¬ 
tral core, the protons and electrons combine to 
form neutrons. The core will finally consist al¬ 
most entirely of neutrons, which become degen¬ 
erate because of the high density. The degeneracy 
pressure of the neutrons will stop the collapse of 
a small mass core. However, if the mass of the 
core is large enough, a black hole will probably 
be formed. 


12.5 The Final Stages of Evolution 

The endpoints of stellar evolution can be seen 
from Fig. 12.7. This shows the relation between 
mass and central density for a body at zero tem¬ 
perature, i.e. the final equilibrium when a mas¬ 
sive body has cooled. There are two maxima 
on the curve. The mass corresponding to the 
left-hand maximum is called the Chandrasekhar 
mass, Mch ~ 1.2-1.4 Mq, and that correspond¬ 
ing to the right-hand one, the Oppenheimer- 
Volkoff mass, Mov ~ 1.5-2 Mq. 

Let us first consider a star with mass less than 
Mch- Suppose the mass does not change. When 
the nuclear fuel is exhausted, the star will become 
a white dwarf, which will gradually cool down 
and contract. In Fig. 12.5 it moves horizontally 
to the right. Finally it will reach zero tempera¬ 
ture and end up on the left-hand rising part of the 
equilibrium curve. Its final equilibrium is a com¬ 
pletely degenerate black dwarf. 

If the mass of the star is larger than Mch but 
smaller than Mov, it can continue cooling until it 
reaches the right-hand rising section of the curve. 
Again there is a stable final state, this time corre¬ 
sponding to a completely degenerate neutron star. 

An even more massive star with mass larger 
than Mqv will go on contracting past the den- 
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sity corresponding to a neutron star. There is 
then no longer any known possible stable equi¬ 
librium, and the star must go on contracting to 
form a black hole. 

The only endpoints of stellar evolution pre¬ 
dicted by theory are the two stable states of 
Fig. 12.7 and the two extreme possibilities, col¬ 
lapse to a black hole or explosive disruption. 



Fig. 12.5 The structure of a massive star (30 Mg) at a late 
evolutionary stage. The star consists of layers with differ¬ 
ent composition separated by nuclear burning shells 


The preceding considerations are purely theo¬ 
retical. The final evolutionary stages of real stars 
involve many imperfectly known factors, which 
may affect the final equilibrium. Perhaps most 
important is the question of mass loss, which is 
very difficult to settle either observationally or 
theoretically. For example, in a supernova explo¬ 
sion the whole star may be disrupted and it is very 
uncertain whether what remains will be a neutron 
star, a black hole or nothing at all. (The structure 
of compact stars will be discussed in Chap. 15.) 

A summary of the various evolutionary paths 
is given in Fig. 12.8. 


12.6 The Evolution of Close Binary 
Stars 

If the components of a binary star are well sep¬ 
arated, they do not significantly perturb one an¬ 
other. When studying their evolution, one can re¬ 
gard them as two single stars evolving indepen¬ 
dently, as described above. However, in close bi¬ 
nary pairs, this will no longer be the case. 

Close binary stars are divided into three 
classes, as shown in Fig. 12.9: detached , semide¬ 
tached and contact binaries. The figure-eight 



Fig. 12.6 The usual endpoint for the development of scope. For a massive star, the life ends with a super- 

a star with a mass of less than three solar masses, nova explosion. On the right , the supernova remnant Cas- 

is a white dwarf, with an expanding planetary neb- siopeia A on radio wavelengths. The image was created 

ula around it. On the left, the planetary nebula NGC by the VLA telescope. (Images Gemini Observatory/Abu 

6369, photographed with the 8-meter Gemini South tele- Team/NOAO/AURA/NSF and NRAO/AUI) 
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Fig. 12.7 The evolutionary end points of stars with dif¬ 
ferent masses shown as a function of central density. The 
curve shows the behaviour of the central density of com¬ 
pletely degenerate (T = 0 K) bodies. The Chandrasekhar 
mass Mch and the Oppenheimer-Volkoff mass Mov cor¬ 
respond to maxima on this curve 

curve drawn in the figure is an equipotential sur¬ 
face called the Roche surface. If the star becomes 
larger than this surface, it begins to lose mass to 
its companion through the waist of the Roche sur¬ 
face. 

During the main sequence phase the stellar ra¬ 
dius does not change much, and each component 
will remain within its own Roche lobe. When 
the hydrogen is exhausted, the stellar core will 
rapidly shrink and the outer layers expand, as we 
have seen. At this stage a star may exceed its 
Roche lobe and mass transfer may set in. 

Close binary stars are usually seen as eclipsing 
binaries. One example is Algol in the constella¬ 
tion Perseus. The components in this binary sys¬ 
tem are a normal main sequence star and a sub¬ 
giant, which is much less massive than the main 
sequence star. The subgiant has a high luminos¬ 
ity and thus has apparently already left the main 
sequence. This is unexpected, since the compo¬ 
nents were presumably formed at the same time, 
and the more massive star should evolve more 
rapidly. The situation is known as the Algol para¬ 
dox: for some reason, the less massive star has 
evolved more rapidly. 

In the 1950’s a solution to the paradox pro¬ 
posed that the subgiant was originally more mas¬ 
sive, but that it had lost mass to its companion 
during its evolution. Since the 1960’s mass trans¬ 
fer in close binary systems has been much stud¬ 


ied, and has turned out be a very significant factor 
in the evolution of close binaries. 

As an example, let us consider a close binary, 
where the initial masses of the components are 
1 and 2 solar masses and the initial orbital pe¬ 
riod 1.4 days (Fig. 12.10). After evolving away 
from the main sequence the more massive com¬ 
ponent will exceed the Roche limit and begin to 
lose mass to its companion. Initially the mass will 
be transferred on the thermal time scale, and after 
a few million years the roles of the components 
will be changed: the initially more massive com¬ 
ponent has become less massive than its compan¬ 
ion. 

The binary is now semidetached and can be 
observed as an Algol-type eclipsing binary. The 
two components are a more massive main se¬ 
quence star and a less massive subgiant filling its 
Roche surface. The mass transfer will continue, 
but on the much slower nuclear time scale. Fi¬ 
nally, mass transfer will cease and the less mas¬ 
sive component will contract to a 0.6 Mq white 
dwarf. 

The more massive 2.4 Mq star now evolves 
and begins to lose mass, which will accumulate 
on the surface of the white dwarf. The accumu¬ 
lated mass may give rise to nova outbursts, where 
material is ejected into space by large explosions. 
Despite this, the mass of the white dwarf will 
gradually grow and may eventually exceed the 
Chandrasekhar mass. The white dwarf will then 
collapse and explode as a type I supernova. 

As a second example, we can take a massive 
binary with the initial masses 20 and 8 Mq and 
the initial period 4.7 days (Fig. 12.1 1). The more 
massive component evolves rapidly, and at the 
end of the main sequence phase, it will transfer 
more than 15 Mq of its material to the secondary. 
The mass transfer will occur on the thermal time 
scale, which, in this case, is only a few ten thou¬ 
sand years. The end result is a helium star, having 
as a companion an unevolved main sequence star. 
The properties of the helium star are like those of 
a Wolf-Rayet star (Fig. 12.12). 

Helium continues to burn to carbon in the core 
of the helium star, and the mass of the carbon core 
will grow. Eventually the carbon will be explo¬ 
sively ignited, and the star will explode as a su- 
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Fig. 12.8 Evolution schemes for stars with different 
masses. The radius is scaled to be the same in all draw¬ 
ings. In reality, there are vast differences in the sizes of 
different stars and different phases of evolution. In the be¬ 
ginning (i) a gas cloud is contracting rapidly in free fall. 
Because the gas is quite rarefied, radiation escapes easily 
from the cloud. As the density increases, radiation trans¬ 
port becomes more difficult, and the released energy tends 
to warm up the gas. The contraction lasts until the gas is 
completely ionised, and the star, which has become a pro¬ 
tostar, is in hydrostatic equilibrium (2). The star is convec¬ 
tive throughout its interior. 

Now evolution continues on a thermal time scale. The con¬ 
traction is much slower than in the free-fall phase. The 
phases of further evolution are determined by the mass M 
of the star. For M < 0.08 Mq the temperature in the cen¬ 
tre does not rise high enough for hydrogen burning, and 
these stars contract to planetlike brown dwarfs. Stars with 
M > 0.08 Mq start hydrogen burning when the tempera¬ 
ture has reached about 4 x 10 6 K. This is the beginning 
of the main sequence phase. In the main sequence, the 
lowest-mass stars with 0.08 Mq < M < 0.26 Mq are en¬ 
tirely convective, and thus they remain homogeneous (3). 
Their evolution is very slow, and after all the hydrogen has 
been burnt to helium, they contract to white dwarfs (4). 
The increasing temperature makes the stars with M > 
0.26 Mq radiative in the centre as the opacity de¬ 
creases (5). The low-mass stars with 0.26 Mq < M < 
1.5 Mq remain radiative in the centre during the main 
sequence phase (6) as they burn their hydrogen through 
the pp chain. The outer part is convective. At the end of 
the main sequence phase, hydrogen burning continues in 
a shell surrounding the helium core (7). 


The outer part expands, and the giant phase begins. The 
contracting helium core is degenerate and warms up. At 
about 10 s K, the triple alpha process begins and leads im¬ 
mediately to the helium flash ( 8 ). The explosion is damped 
by the outer parts, and helium burning goes on in the 
core (9). Hydrogen is still burning in an outer shell. As the 
central helium is exhausted, helium burning moves over to 
a shell (10). At the same time, the outer part expands and 
the star loses some of its mass. The expanding envelope 
forms a planetary nebula (11). The star in the centre of the 
nebula becomes a white dwarf (12). 

In the upper main sequence with M > 1.5 Mq energy is 
released through the CNO cycle, and the core becomes 
convective, while the outer part is radiative (13). The main 
sequence phase ends as the hydrogen in the core is ex¬ 
hausted, and shell burning begins (14). The helium core 
remains convective and nondegenerate, and helium burn¬ 
ing begins without perturbations (15 and 19). Afterwards, 
helium burning moves over to a shell (16 and 20). For 
stars with 3 Mq < M < 15 Mq the carbon in the core is 
degenerate, and a carbon flash occurs (17). This leads to 
a supernova explosion (18) and possibly to the complete 
destruction of the star. 

For the most massive stars with M > 15 Mq the carbon 
core remains convective, and carbon burns to oxygen and 
magnesium. Finally, the star consists of an iron core sur¬ 
rounded by shells with silicon, oxygen, carbon, helium 
and hydrogen (21). The nuclear fuel is now exhausted, and 
the star collapses on a dynamical time scale. The result is 
a supernova (22). The outer parts explode, but the remain¬ 
ing core continues to contract to a neutron star or a black 
hole 
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Fig. 12.8 (Continued) 



Fig. 12.9 The types of close binary systems: (a) de¬ 
tached, (b) semidetached and (c) contact binary 

pernova. The consequences of this explosion are 
not known, but let us suppose that a 2 Mq com¬ 
pact remnant is left. As the more massive star ex¬ 
pands, its stellar wind will become stronger, giv¬ 
ing rise to strong X-ray emission as it hits the 
compact star. This X-ray emission will only cease 
when the more massive star exceeds its Roche 
surface. 


The system will now rapidly lose mass and 
angular momentum. A steady state is finally 
reached when the system contains a 6 Mq helium 
star in addition to the 2 Mq compact star. The 
helium star is seen as a Wolf-Rayet star, which, 
after about a million years, explodes as a super¬ 
nova. This will probably lead to the breakup of 
the binary system. However, for certain values 
of the mass, the binary may remain bound. Thus 
a binary neutron star may be formed. 


12.7 Comparison with Observations 

The most important direct support for the theo¬ 
retical evolutionary models is obtained from the 
properties of observed HR diagrams. If the the¬ 
oretical models are correct, the observed number 
of stars should reflect the duration of the various 
evolutionary phases. These are given for stars of 
different masses in Table 12.1. The stars are most 
numerous along the main sequence. Giants are 
also common and, in addition to these, there are 
white dwarfs, subgiants, etc. The sparsely popu¬ 
lated region to the right of the main sequence, the 
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Hertzsprung gap , is explained by the rapid tran¬ 
sition from the main sequence to the giant phase. 

The cepheids provide an important test for 
the evolutionary models. The pulsations and the 
relation between period and luminosity for the 
cepheids can be understood on the basis of the¬ 
oretical stellar models. 

The evolutionary models can also explain the 
HR diagrams of star clusters. Let us assume that 
all the stars in a cluster were formed at the same 
time. In the youngest systems, the associations, 
the stars will mainly be found on the upper main 



Fig. 12.10 Evolution of a low-mass binary: (a) both 
components on the main sequence; (b) mass transfer from 
the more massive component; (c) light subgiant and mas¬ 
sive main sequence star; (d) white dwarf and main se¬ 
quence star; (e) mass transferred to the white dwarf from 
the more massive component leads to nova outbursts; 
(f) the white dwarf mass exceeds the Chandrasekhar mass 
and explodes as a type I supernova 


sequence, since the most massive stars evolve 
most rapidly. To the right of the main sequence, 
there will be less massive T Tauri stars, which are 
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Fig. 12.11 Evolution of a massive binary. It has been 
assumed that the supernova explosion of a 5 Mg helium 
star leaves a 2 Mg compact remnant (neutron star or black 
hole), (a) Main sequence phase; (b) beginning of the first 
mass transfer phase; (c) end of the first mass transfer 
phase; the first Wolf-Rayet phase begins; (d) the helium 
star (Wolf-Rayet star) has exploded as a supernova; (e) the 
23 Mg component becomes a supergiant; the compact 
component is a strong X-ray source; (f) beginning of the 
second mass transfer phase; the X-ray source is throttled 
and large-scale mass loss begins; (g) second Wolf-Rayet 
phase; (h) the 6 Mg helium star has exploded as a super¬ 
nova; the binary may or may not be disrupted, depending 
on the remaining mass 
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Fig. 12.12 The Wolf-Rayet star WR 104 photographed 
by the 10-m Keck telescope. The spiral is dust and gas 
which is thrown out from the rotating binary system. The 
spiral "pinwheel” is seen to make a full revolution in about 
220 days. (Photo U.C. Berkeley Space Sciences Labora¬ 
tory AV.M. Keck Observatory) 

still contracting. In intermediate age open clus¬ 
ters, the main sequence will be well developed 
and its upper end should bend to the right, since 
the most massive stars will already have begun to 
evolve off the main sequence. In the old globular 
clusters, the giant branch should increase in im¬ 
portance in the older clusters. These predictions 
are confirmed by the observations, which will be 
further discussed in Chap. 16 on star clusters. 

Of course, the most detailed observations can 
be made of the Sun, which is therefore a crucial 
point of comparison for the theoretical models. 
If a star of one solar mass with an initial com¬ 
position of 71 % hydrogen, 27 % helium and 
2 % heavier elements is allowed to evolve for 
5000 million years, it will be very similar to our 
present Sun. In particular, it will have the same 
radius, surface temperature and luminosity. Ac¬ 
cording to calculations, about half of the Sun's 
supply of hydrogen fuel has been consumed. The 
Sun will go on shining like a normal main se¬ 
quence star for another 5000 million years, before 
there will be any dramatic change. 

Some problems remain in regard to the obser¬ 
vations. One is the solar neutrino problem. The 


neutrinos produced by solar nuclear reactions 
have been observed since the beginning of the 
1970’s by the techniques described in Sect. 3.7. 
Only the neutrinos formed in the relatively rare 
ppIII reaction are energetic enough to be ob¬ 
served in this way. Their observed number is too 
small: whereas the models predict about 5 units, 
the observations have consistently only registered 
1 - 2 . 

The discrepancy may be due to a fault in 
the observational technique or to some unknown 
properties of the neutrinos. However, if the solar 
models are really in error, the central temperature 
of the Sun would have to be about 20 % lower 
than thought, which would be in serious conflict 
with the observed solar luminosity. One possibil¬ 
ity is that some of the electron neutrinos change 
to other, unobservable particles during their pas¬ 
sage to Earth. (See also Sect. 12.1.) 

A second problem is the observed abundance 
of lithium and beryllium. The solar surface con¬ 
tains a normal abundance of beryllium, but very 
little lithium. This should mean that during its 
contraction, the Sun was still fully convective 
when the central temperature was high enough 
to destroy lithium (3 x 10 6 K), but not beryl¬ 
lium (4 x 10 6 K). However, according to the stan¬ 
dard solar evolution models, convection ceased in 
the centre already at a temperature of 2 x 10 6 K. 
One suggested explanation is that the convection 
has later carried down lithium to layers where the 
temperature is high enough to destroy it. 


12.8 The Origin of the Elements 

There are just under a hundred naturally occur¬ 
ring elements, and about 300 isotopes in the solar 
system (Fig. 12.13). In Sect. 12.4, we have seen 
how the elements up to iron are produced when 
hydrogen burns to helium and helium further to 
carbon, oxygen and heavier elements. 

Almost all nuclei heavier than helium were 
produced in nuclear reactions in stellar interiors. 
In the oldest stars, the mass fraction of heavy 
elements is only about 0.02 %, whereas in the 
youngest stars it is a few per cent. Neverthe¬ 
less, most of the stellar material is hydrogen and 
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Fig. 12.13 Element 
abundances in the solar 
system as a function of the 
nuclear mass number. The 
abundance of Si has been 
normalised as 10 6 



helium. According to the standard cosmological 
model, those were formed in the early stages of 
the Universe, when the temperature and density 
were suitable for nuclear reactions. (This will be 
discussed in Chap. 20.) Although helium is pro¬ 
duced during the main sequence stellar evolution, 
very little of it is actually returned into space to be 
incorporated into later stellar generations. Most 
of it is either transformed into heavier elements 
by further reactions, or else remains locked up in¬ 
side white dwarf remnants. Therefore the helium 
abundance does not increase by much due to stel¬ 
lar processes. 

The most important nuclear reactions leading 
to the build-up of the heavy nuclei up to iron were 
presented in Sect. 11.3. The probabilities of the 
various reactions are determined either by exper¬ 
iments or by theoretical calculations. When they 


are known, the relative abundances of the various 
nuclei produced can be calculated. 

The formation of elements heavier than iron 
requires an input of energy, and thus they cannot 
be explained in the same manner. Still heavy nu¬ 
clei are continually produced. In 1952 technetium 
was discovered in the atmosphere of a red giant. 
The half-life of the most longlived isotope 98 Tc is 
about 1.5 x 10 6 years, so that the observed tech¬ 
netium must have been produced in the star. 

Most of the nuclei more massive than iron are 
formed by neutron capture (Fig. 12.14). Since 
the neutron does not have an electric charge, it 
can easily penetrate into the nucleus. The prob¬ 
ability for neutron capture depends both on the 
kinetic energy of the incoming neutron and on 
the mass number of the nucleus. For example, in 
the solar system the abundances of isotopes show 
maxima at the mass numbers A — 70-90, 130, 
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Fig. 12.14 Neutron capture paths for the s-process and 
r-process (from left to right). The s-process follows a path 
along the line of beta stability. The stable r-process nuclei 
(small circles ) result from beta decay of their neutron rich 
progenitors on the shaded path shown lower. Beta decay 
occurs along straight lines A = const. The closed neutron 

138, 195 and 208. These mass numbers cor¬ 
respond to nuclei with closed neutron shells at 
the neutron numbers N = 50, 82, and 126. The 
neutron capture probability for these nuclei is 
very small. The closed shell nuclei thus react 
more slowly and are accumulated in greater abun¬ 
dances. 

In a neutron capture, a nucleus with mass 
number A is transformed into a more massive nu¬ 
cleus: 

(Z, A) + n —> (Z, A+ l) + y. 

The newly formed nucleus may be unstable to 
fi decay, where one neutron is transformed into 
a proton: 

(Z, A + 1) —>- (Z + 1, A + 1) + e + v e . 

Two kinds of neutron capture processes are en¬ 
countered, depending on the value of the neutron 
flux. In the slow s-process, the neutron flux is 


shells in nuclei at N = 50, 82 and 126 correspond to abun¬ 
dance peaks in s-process nuclei at A = 88. 138 and 208, 
and in r-process nuclei at A = 80, 130 and 195. (Seeger, 
P.A., Fowler, W.A., Clayton, D.D. (1965): Astrophys. J. 
Suppl. 11, 121) 


so small that any fi decays have had time to oc¬ 
cur before the next neutron capture reaction takes 
place. The most stable nuclei up to mass num¬ 
ber 210 are formed by the s-process. These nu¬ 
clei are said to correspond to the fi stability val¬ 
ley. The s-process explains the abundance peaks 
at the mass numbers 88, 138 and 208. 

When the neutron flux is large, fi decays do 
not have time to happen before the next neu¬ 
tron capture. One then speaks of the rapid r- 
process, which gives rise to more neutron-rich 
isotopes. The abundance maxima produced by 
the r-process lie at mass numbers about ten units 
smaller than those of the s-process. 

A neutron flux sufficient for the s-process is 
obtained in the course of normal stellar evolu¬ 
tion. For example, some of the carbon and oxy¬ 
gen burning reactions produce free neutrons. If 
there is convection between the hydrogen and he¬ 
lium burning shells, free protons may be carried 
into the carbon-rich layers. Then the following 
neutron-producing reaction chain becomes im- 
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portant: 


12 c + p - 

» 13 N+k, 

13 N 

* 13 C + e+ + v e 

13 C+ 4 He- 

* 16 0 + n. 


The convection can also carry the reaction prod¬ 
ucts nearer to the surface. 

The neutron flux required for the r-process 
is about 10 22 cm -3 , which is too large to be 
produced during normal stellar evolution. The 
only presently known site where a large enough 
neutron flux is expected is near a neutron star 
forming in a supernova explosion. In this case, 
the rapid neutron capture leads to nuclei that 
cannot capture more neutrons without becoming 
strongly unstable. After one or more rapid ( J > de¬ 
cays, the process continues. 

The r-process stops when the neutron flux de¬ 
creases. The nuclei produced then gradually de¬ 
cay by the /J-process towards more stable iso¬ 
topes. Since the path of the r-process goes about 
ten mass units below the stability valley, the abun¬ 
dance peaks produced will fall about ten units 
below those of the s-process. This is shown in 
Fig. 12.11. The most massive naturally occurring 
elements, such as uranium, thorium and pluto¬ 
nium, are formed by the r-process. 

There are about 40 isotopes on the proton-rich 
side of the f3 stability valley that cannot be pro¬ 
duced by neutron capture processes. Their abun¬ 
dances are very small, relative to the neighbour¬ 
ing isotopes. They are formed in supernova ex¬ 
plosions at temperatures higher than 10 9 K by re¬ 
actions known as the p-process. At this tempera¬ 
ture, pair formation can take place: 

Y -*e + + e“. 

The positron may either be annihilated immedi¬ 
ately or be consumed in the reaction 

e + + (Z, A) (Z + 1, A) + v e . 


the isotopes 184 W, 190 Pt and 196 Hg formed by the 
fission of lead. 

All the preceding reaction products are ejected 
into the interstellar medium in the supernova 
explosion. Collisions between cosmic rays and 
heavy nuclei then finally give rise to the light el¬ 
ements lithium, beryllium and boron. Thus the 
abundances of essentially all naturally occurring 
isotopes can be explained. 

During succeeding generations of stars the rel¬ 
ative abundance of heavy elements increases in 
the interstellar medium. They can then be incor¬ 
porated into new stars, planets—and living be¬ 
ings. 


12.9 Example 


Example 12.1 An interstellar cloud has a mass 
of one solar mass and density of 10 10 hydrogen 
atoms per cm 3 . Its rotation period is 1000 years. 
What is the rotation period after the cloud has 
condensed into a star of solar size? 

The angular momentum is L — Iu >, where to is 
the angular velocity and I is the moment of iner¬ 
tia. For a homogeneous sphere 

2 , 

I = -MR 2 , 

5 

where M is the mass and R the radius. From the 
conservation of the angular momentum we get 


L — I\oo\ — I2CO2 

I\ 2 n I22 tc 


P\ 


Pi 


h \mr 2 

=> P2 = P\f = P\\ - \ = P\ 

h § MR 2 


where P\ and Pj are the rotation periods before 
and after the collapse. The mass of the cloud is 


Another reaction in the p-process is 

(Z, A) + p —> (Z + 1, A + 1) + y. 

Finally, the fission of some heavier isotopes may 
give rise to p-process nuclei. Examples of this are 


4 1 

M = -n R 2 p 
3 

= -itR 3 x 10 16 x 1.6734 x 10“ 27 kg 
3 

= 1 Mq — 1.989 x 10 30 kg. 
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Solving for the radius we get R = 3 x 10 13 m. 
The rotation period after the collapse is 


P 2 = 1000 a x 


/6.96 x 10 s m\ 2 
V 3 x 10 13 m ) 


— 5.4 x 10 -7 a = 17 s. 


This is several orders of magnitude shorter than 
the actual period. Somehow the star has to get rid 
of most of its angular momentum during the pro¬ 
cess. 


12.10 Exercises 

Exercise 12.1 Find the free fall time scale for 
a hydrogen cloud, if the density of Fb molecules 
is 3000 cm -3 . Assume that stars condense from 
such clouds, there are 100 clouds in the Galaxy, 


the mass of each cloud is 5 x 10 4 Mq, and 10 % 
of the mass is converted into stars. Also assume 
that the average mass of a star is 1 Mq. How 
many stars are born in one year? 

Exercise 12.2 The mass of Vega (spectral class 
A0V) is 2 Mq, radius 3 Rq, and luminosity 
60 Lq. Find its thermal and nuclear time scales. 

Exercise 12.3 Assume that a star remains 10 9 
years in the main sequence and burns 10 % of its 
hydrogen. Then the star will expand into a red gi¬ 
ant, and its luminosity will increase by a factor 
of 100. How long is the red giant stage, if we as¬ 
sume that the energy is produced only by burning 
the remaining hydrogen? 
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The Sun is our nearest star. It is important for 
astronomy because many phenomena which can 
only be studied indirectly in other stars can be di¬ 
rectly observed in the Sun (e.g. stellar rotation, 
starspots, the structure of the stellar surface). Our 
present picture of the Sun is based both on ob¬ 
servations and on theoretical calculations. Some 
observations of the Sun disagree with the theoret¬ 
ical solar models. The details of the models will 
have to be changed, but the general picture should 
remain valid. 


13.1 Internal Structure 

The Sun is a typical main sequence star. Its prin¬ 
cipal properties are: 


Mass 

m = Mg 

= 

1.989 x 10 30 kg 

Radius 

R = Rq 

= 

6.960 x 10 8 m 

Mean density 

P 

= 

1409 kg/m 3 

Central density 

Pc 

= 

1.6 x 10 5 kg/m 3 

Luminosity 

o 

II 

= 

3.9 x 10 26 W 

Effective 

temperature 

T e 

= 

5785 K 

Central temperature 

T c 

= 

1.5 x 10 7 K 

Absolute 

bolometric 

magnitude 

M\yoi 

" 

4.72 

Absolute visual 
magnitude 

My 

= 

4.79 

Spectral class 



G2V 

Colour indices 

B - V 

= 

0.62 


U-B 

= 

0.10 


Surface 

X 

= 

0.71 

chemical 

composition 

Y 

= 

0.27 

Z 

= 

0.02 


Rotational period 


at the equator 



25 d 

at latitude 60° 



29 d 



Fig. 13.1 The distribution of temperature, pressure, en¬ 
ergy production and mass as functions of radius in the Sun 

On the basis of these data, the solar model 
shown in Fig. 13.1 has been calculated. The en¬ 
ergy is produced by the pp chain in a small cen¬ 
tral region. 99 % of the solar energy is produced 
within a quarter of the solar radius. 

The Sun produces energy at the rate of 4 x 
10 26 W, which is equivalent to changing about 
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Fig. 13.2 The interior and 
surface of the Sun. The 
various kinds of solar 
phenomena are 
schematically indicated. 
(Based on Van Zandt, R.P. 
(1977): Astronomy for the 
Amateur, Planetary 
Astronomy, Vol. 1, 3rd edn. 
(published by the author, 
Peoria, III.)) 



four million tonnes of mass into energy every 
second. The mass of the Sun is so large, about 
330,000 times that of the Earth, that during the 
whole main sequence lifetime of the Sun less than 
0.1 % of its mass is turned into energy. 

When the Sun formed about 5000 million 
years ago, its composition was the same every¬ 
where as its present surface composition. Since 
energy production is concentrated at the very cen¬ 
tre, hydrogen is consumed most rapidly there. At 
about a quarter of the radius the hydrogen abun¬ 
dance is still the same as in the surface layers, 
but going inwards from that point it rapidly de¬ 
creases. In the central core only 40 % of the ma¬ 
terial is hydrogen. About 5 % of the hydrogen in 
the Sun has been turned into helium. 

The radiative central part of the Sun extends 
to about 70 % of the radius. At that radius the 
temperature has dropped so much that the gas 
is no longer completely ionised. The opacity of 
the solar material then strongly increases, inhibit¬ 


ing the propagation of radiation. In consequence, 
convection becomes a more efficient means of en¬ 
ergy transport. Thus the Sun has a convective en¬ 
velope (Fig. 13.2). 

The Solar Neutrino Problem The central nu¬ 
clear reactions produce neutrinos at several of the 
steps in the pp chain (see Fig. 11.5). These neutri¬ 
nos can propagate freely through the outer layers, 
and thus give direct information about conditions 
near the centre of the Sun. When neutrinos from 
the Sun were first observed in the 1970’s, their 
number was found to be only about a third of 
what was predicted. This disagreement is called 
the solar neutrino problem. 

In the first experiments only neutrinos from 
the ppll and ppIII branches were observed 
(Sect. 11.3). Since only a small fraction of the 
solar luminosity is produced in these reactions, 
it was not clear what were the consequences 
of these results for solar models. In the 1990’s 
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neutrinos produced in the ppl branch, the main 
branch of the pp chain, were observed. Although 
the disagreement with the standard models was 
slightly smaller in these observations (about 60 % 
of the predicted flux was observed), the neutrino 
problem still remained. 

Perhaps the most popular explanation for the 
solar neutrino problem was based on neutrino 
oscillations. According to this explanation, if 
neutrinos have a small mass (about 10 -2 eV), 
an electron neutrino could change into a /x or 
a r neutrino as it passed through the outer parts 
of the Sun. In the early experiments only electron 
neutrinos were observed, representing only part 
of the total number of neutrinos produced. 

In 2001 the Canadian Sudbury neutrino obser¬ 
vatory (SNO) announced results that seemed to 
solve the problem. The SNO can detect the flux of 
all different neutrinos and the fraction of the elec¬ 
tron neutrinos. The measurements showed that 
the total neutrino flux was indeed consistent with 
the predictions of the solar models, but only 35 % 
of the flux consisted of electron neutrinos. Thus 
65 % of the solar electron neutrinos had changed 
to /x or r neutrinos while travelling from the Sun 
to the Earth. Similar observations were also made 
in a Japanese neutrino observatory. Among the 
pioneers of the neutrino astronomy are Raymond 
Davis and Masatoshi Koshiba who shared the No¬ 
bel prize in physics in 2002. 

The solar neutrino problem can now be con¬ 
sidered to be solved. The solution is a great suc¬ 
cess for the standard solar model. But it has also 
revealed the existence of neutrino oscillations, 
proving that neutrinos have a small but non-zero 
rest mass. This shows that the standard model of 
particle physics needs to be revised in some re¬ 
spects. 

The Solar Rotation As soon as telescopes 
were introduced, it was observed from the mo¬ 
tions of sunspots that the Sun is rotating with 
a rotational period of about 27 days. As early as 
1630 Christoph Schemer showed that there was 
differential rotation', the rotational period near the 
poles was more than 30 days, while it was only 
25 days at the equator. The rotational axis of the 


Sun is inclined at 7° with respect to the plane of 
the ecliptic, so that the North Pole of the Sun is 
best visible from the Earth in September. 

The motions of sunspots still give the best in¬ 
formation on the rotation near the surface of the 
Sun. Other surface features also have been used 
for this purpose. The rotational velocity has also 
been measured directly from the Doppler effect. 
The angular velocity is usually written 

Q = A - B sin 2 f, (13.1) 

where is the latitude with respect to the equa¬ 
tor. The measured values of the coefficients are 
A — 14.5 and B — 2.9 degrees/day. 

The rotational velocity deeper down in the 
Sun cannot be directly observed. In the 1980’s 
a method to estimate the rotation in the interior 
became available, when it became possible to 
measure the frequencies of solar oscillations from 
the variations in spectral lines. These oscillations 
are essentially sound waves produced by turbu¬ 
lent gas motions in the convection zone. These 
sound waves have calculable oscillation periods 
(about 3-12 minutes), which depend on the con¬ 
ditions in the solar interior. By comparing the ob¬ 
served and theoretical values one can get infor¬ 
mation about the conditions deep inside the Sun. 
The idea of the method is the same as that used 
when studying the interior of the Earth by means 
of waves from earthquakes, and it is therefore 
called helioseismology. 

Using helioseismology, models for the so¬ 
lar rotation throughout the convection zone have 
been deduced. It appears that the angular velocity 
in the whole convection zone is almost the same 
as at the surface, although it decreases slightly 
with radius near the equator, and increases near 
the poles. The angular velocity of the radiative 
core is still uncertain, but there are indications 
that the core is rotating as a solid body with ap¬ 
proximately the average surface angular velocity. 
At the bottom of the convection zone there is a 
thin layer known as the tachocline, where the an¬ 
gular velocity changes rapidly with radius. The 
internal solar rotation according to the helioseis- 
mological studies is shown in Fig. 13.3. 
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The solar differential rotation is maintained by 
gas motions in the convection zone. Explaining 
the observed behaviour is a difficult problem that 
is not yet completely understood. 


13.2 The Atmosphere 

The solar atmosphere is divided into the photo¬ 
sphere and the chromosphere. Outside the actual 
atmosphere, the corona extends much further out¬ 
wards. 

The Photosphere The innermost layer of the 
atmosphere is the photosphere, which is only 
about 300-500 km thick. The photosphere is 
the visible surface of the Sun, where the den¬ 
sity rapidly increases inwards, hiding the interior 
from sight. The temperature at the inner bound¬ 
ary of the photosphere is 8000 K and at the outer 



Fig. 13.3 The rotation rate of the Sun inferred from he- 
lioseismological observations. The equator is at the hori¬ 
zontal axis and the pole is at the vertical axis, both axes 
being labelled by fractional radius. Some contours are la¬ 
belled in nHz, and, for clarity, selected contours are shown 
as bold. (430 nHz is about 26.9 days.) The dashed circle is 
at the base of the convection zone and the tick marks at the 
edge of the outer circle are at latitudes 15°, 30°, 45°, 60°, 
75°. The shaded area indicates the region in the Sun where 
no reliable inference can be made with present data. The 
slanted dotted lines are at an angle of 27° with the rotation 
axis. (Adapted from Schou et al. 1998.) (J. Christensen- 
Dalsgaard 2007, astro-ph/0610942, Fig. 2) 


boundary 4500 K. Near the edge of the solar 
disk, the line of sight enters the photosphere at 
a very small angle and never penetrates to large 
depths. Near the edges one therefore only sees 
light from the cooler, higher layers. For this rea¬ 
son, the edges appear darker; this phenomenon 
is known as limb darkening. Both the continuous 
spectrum and the absorption lines are formed in 
the photosphere, but the light in the absorption 
lines comes from higher layers and therefore the 
lines appear dark. 

The solar convection is visible on the surface 
as the granulation (Fig. 13.4), an uneven, con¬ 
stantly changing granular pattern. At the bright 
centre of each granule, gas is rising upward, and 
at the darker granule boundaries, it is sinking 
down again. The size of a granule seen from 
the Earth is typically 1", corresponding to about 
1000 km on the solar surface. There is also 
a larger scale convection called supergranulation 
in the photosphere. The cells of the supergranu¬ 
lation may be about F in diameter. The observed 
velocities in the supergranulation are mainly di¬ 
rected along the solar surface. 

The Chromosphere Outside the photosphere 
there is a layer, perhaps about 500 km thick, 
where the temperature increases from 4500 K to 
about 6000 K, the chromosphere. Outside this 
layer, there is a transition region of a few thou¬ 
sand kilometres, where the chromosphere gradu¬ 
ally goes over into the corona. In the outer parts 
of the transition region, the kinetic temperature is 
already about 10 6 K. 

Normally the chromosphere is not visible, be¬ 
cause its radiation is so much weaker than that 
of the photosphere. However, during total solar 
eclipses, the chromosphere shines into view for 
a few seconds at both ends of the total phase, 
when the Moon hides the photosphere com¬ 
pletely. The chromosphere then appears as a thin 
reddish sickle or ring. 

During eclipses the chromospheric spectrum, 
called the flash spectrum, can be observed 
(Fig. 13.5). It is an emission line spectrum with 
more than 3000 identified lines. Brightest among 
these are the lines of hydrogen, helium and cer¬ 
tain metals. 
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Fig. 13.4 The granulation 
of the solar surface. The 
granules are produced by 
streaming gas. Their 
typical diameter is 
1000 km. The picture was 
taken in May 13, 2005 with 
the Swedish one metre 
solar telescope on La 
Palma. (Photograph Tom 
Berger, Royal Swedish 
Academy of Sciences, 
ISP/RSAS) 



Fig. 13.5 Flash spectrum 
of the solar chromosphere, 
showing bright emission 
lines 



One of the strongest chromospheric emission 
lines is the hydrogen Balmer a line (Fig. 13.6) 
at a wavelength of 656.3 nm. Since the H„ line 
in the normal solar spectrum is a very dark ab¬ 
sorption line, a photograph taken at this wave¬ 
length will show the solar chromosphere. For 
this purpose, one uses narrow-band filters letting 
through only the light in the H„ line. The re¬ 
sulting pictures show the solar surface as a mot¬ 
tled, wavy disk. The bright regions are usually 
the size of a supergranule, and are bounded by 
spicules (Fig. 13.7). These are flamelike struc¬ 
tures rising up to 10,000 km above the chromo¬ 
sphere, and lasting for a few minutes. Against 


the bright surface of the Sun, they look like 
dark streaks; at the edges, they look like bright 
flames. 

The Corona The chromosphere gradually goes 
over into the corona. The corona is also best seen 
during total solar eclipses (Fig. 13.8). It then ap¬ 
pears as a halo of light extending out to a few 
solar radii. The surface brightness of the corona 
is about that of the full moon, and it is therefore 
difficult to see next to the bright photosphere. 

The inner part of the corona, the K corona, 
has a continuous spectrum formed by the scatter¬ 
ing of the photospheric light by electrons. Fur- 
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Fig. 13.6 The solar 
surface in the hydrogen H a 
line. Active regions appear 
bright; the dark filaments 
are prominences. Limb 
darkening has been 
removed artificially, which 
brings to light spicules and 
prominences above the 
limb. The photograph was 
taken in October 1997. 
(Photograph Big Bear 
Solar Observatory/NJIT) 



Fig. 13.7 Spicules, 
flamelike uprisings near the 
edge of the solar disc. 
(Photograph Big Bear 
Solar Observatory) 



ther out, a few solar radii from the surface, is the 
F corona, which has a spectrum showing Fraun¬ 
hofer absorption lines. The light of the F corona 
is sunlight scattered by dust. 

In the latter part of the 19th century strong 
emission lines, which did not correspond to those 
of any known element, were discovered in the 


corona (Fig. 13.9). It was thought that a new ele¬ 
ment, called coronium, had been found—a little 
earlier, helium had been discovered in the Sun 
before it was known on Earth. About 1940, it 
was established that the coronal lines were due to 
highly ionised atoms, e.g. thirteen times ionised 
iron. Much energy is needed to remove so many 
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Fig. 13.8 Previously, the 
corona could be studied 
only during total solar 
eclipses. The picture is 
from the eclipse on 
March 7, 1970. Nowadays 
the corona can be studied 
continuously using 
a device called the 
coronagraph 



Fig. 13.9 The presence of 
lines from highly ionised 
atoms in the coronal 
spectrum shows that the 
temperature of the corona 
has to be very high 



electrons from the atoms. The entire corona has 
to have a temperature of about a million de¬ 
grees. 

A continuous supply of energy is needed in 
order to maintain the high temperature of the 
corona. According to earlier theories, the energy 
came in the form of acoustic or magnetohydrody¬ 
namic shock waves generated at the solar surface 
by the convection. Most recently, heating by elec¬ 


tric currents induced by changing magnetic fields 
has been suggested. Heat would then be gener¬ 
ated in the corona almost like in an ordinary light 
bulb. 

In spite of its high temperature the coronal gas 
is so diffuse that the total energy stored in it is 
small. It is constantly streaming outwards, gradu¬ 
ally becoming a solar wind, which carries a flux 
of particles away from the Sun. The gas lost in 
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Fig. 13.10 The sunspots 
are the form of solar 
activity that has been 
known for the longest time. 
The photograph was taken 
with the Swedish 1-meter 
Solar Telescope in July 
2002. (Photograph Royal 
Swedish Academy of 
Sciences) 



this way is replaced with new material from the 
chromosphere. 


13.3 Solar Activity 

Sunspots The clearest visible sign of solar ac¬ 
tivity are the sunspots. The existence of sunspots 
has been known for long (Fig. 13.10), since the 
largest ones can be seen with the naked eye by 
looking at the Sun through a suitably dense layer 
of fog. More precise observations became avail¬ 
able beginning in the 17th century, when Galilei 
started to use the telescope for astronomical ob¬ 
servations. 

A sunspot looks like a ragged hole in the solar 
surface. In the interior of the spot there is a dark 
umbra and around it, a less dark penumbra. By 
looking at spots near the edge of the solar disk, it 
can be seen that the spots are slightly depressed 
with respect to the rest of the surface. The surface 
temperature in a sunspot is about 1500 K below 


that of its surroundings, which explains the dark 
colour of the spots. 

The diameter of a typical sunspot is about 
10,000 km and its lifetime is from a few days to 
several months, depending on its size. The larger 
spots are more likely to be long-lived. Sunspots 
often occur in pairs or in larger groups. By fol¬ 
lowing the motions of the spots, the period of ro¬ 
tation of the Sun can be determined. 

The variations in the number of sunspots 
have been followed for almost 250 years. The 
frequency of spots is described by the Zurich 
sunspot number Z: 

Z = C(S + 10G), (13.2) 

where S is the number of spots and G the num¬ 
ber of spot groups visible at a particular time. C is 
a constant depending on the observer and the con¬ 
ditions of observation. 

In Fig. 13.11, the variations in the Zurich 
sunspot number between the 18th century and 
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Fig. 13.11 The Zurich sunspot number from 1700 to 2001. Prior to 1700 there are only occasional observations. The 
number of sunspots and spot groups varies with a period of about 11 years 



Fig. 13.12 Left'. During a sunspot maximum (Sept 27, 2001) the Sun is dotted by numerous spots. Right'. Exactly seven 
years later no spots are seen, although minimum phase should already be over. (Photos SOHO/MIDI) 


the present are shown. Evidently the number of 
spots varies with an average period of about 
11 years. The actual period may be between 7 
and 17 years. In the past decades, it has been 
about 10.5 years. Usually the activity rises to its 
maximum in about 3-4 years, and then falls off 
slightly more slowly (Fig. 13.12). The period was 
first noted by Samuel Heinrich Schwabe in 1843. 

The variations in the number of sunspots have 
been fairly regular since the beginning of the 
18th century. However, in the 17th century there 
were long intervals when there were essentially 
no spots at all. This quiescent period is called the 
Maunder minimum. The similar Sporer minimum 
occurred in the 15th century, and other quiet in¬ 
tervals have been inferred at earlier epochs. The 
mechanism behind these irregular variations in 
solar activity is not yet understood. 


The magnetic fields in sunspots are measured 
on the basis of the Zeeman effect, and may be 
as large as 0.45 tesla. (The magnetic field of 
the Earth at the equator is 0.03 mT.) The strong 
magnetic field inhibits convective energy trans¬ 
port, which explains the lower temperature of the 
spots. 

Sunspots often occur in pairs where the com¬ 
ponents have opposite polarity. The structure of 
such bipolar groups can be understood if the field 
rises into a loop above the solar surface, connect¬ 
ing the components of the pair. If gas is streaming 
along such a loop, it becomes visible as a loop 
prominence (Fig. 13.13). 

The periodic variation in the number of sun¬ 
spots reflects a variation in the general solar mag¬ 
netic field. At the beginning of a new activity cy¬ 
cle spots first begin to appear at latitudes of about 
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Fig. 13.13 In pairs of 
sunspots the magnetic field 
lines form a loop outside 
the solar surface. Material 
streaming along the field 
lines may form loop 
prominences. Loops of 
different size can be seen 
in this image, which the 
Trace satellite took in 
1999. (Photo Trace) 



Fig. 13.14 At the 

beginning of an activity 
cycle, sunspots appear at 
high latitudes. As the cycle 
advances the spots move 
towards the equator. 
(Diagram by H. Virtanen, 
based on Greenwich 
Observatory observations) 



±40°. As the cycle advances, the spots move 
closer to the equator. The characteristic pattern 
in which spots appear, shown in Fig. 13.14, is 
known as the butterfly diagram. Spots of the next 
cycle begin to appear while those of the old one 
are still present near the equator. Spots belonging 
to the new cycle have a polarity opposite to that 
of the old ones. (Spots in opposite hemispheres 
also have opposite polarity.) Since the field is 
thus reversed between consecutive 11 year cycles 
the complete period of solar magnetic activity is 
22 years. 

The following general qualitative description 
of the mechanism of the solar cycle was pro¬ 
posed by Horace W. Babcock. Starting at a so¬ 
lar minimum, the field will be of a generally 
dipolar character. Because a conducting medium, 
such as the outer layers of the Sun, cannot move 
across the field lines, these will be frozen into the 


plasma and carried along by it. Thus the differ¬ 
ential rotation will draw the field into a tight spi¬ 
ral (Fig. 13.15). In the process the field becomes 
stronger, and this amplification will be a function 
of latitude. 

When the subsurface field becomes strong 
enough, it gives rise to a “magnetic buoyancy” 
that lifts ropes of magnetic flux above the sur¬ 
face. This happens first at a latitude about 40°, 
and later at lower latitudes. These protruding flux 
ropes expand into loops forming bipolar groups 
of spots. As the loops continue expanding they 
make contact with the general dipolar field, which 
still remains in the polar regions. This leads to 
a rapid reconnection of the field lines neutralis¬ 
ing the general field. The final result when ac¬ 
tivity subsides is a dipolar field with a polarity 
opposite the initial one. 
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Fig. 13.15 Because the Sun rotates faster at the equator than at the poles, the field lines of the solar magnetic field are 
drawn out into a tight spiral 



Fig. 13.16 A quiet Sun in August 2006 around the 
last sunspot minimum. Both pictures were taken by the 
Michelson Doppler Imager on the SOHO satellite. On the 

Thus the Babcock model accounts for the but¬ 
terfly diagram, the formation of bipolar magnetic 
regions and the general field reversal between ac¬ 
tivity maxima. Nevertheless, it remains an essen¬ 
tially phenomenological model, and alternative 
scenarios have been proposed. In dynamo the¬ 
ory quantitative models for the origin of mag¬ 
netic fields in the Sun and other celestial bodies 
are studied. In these models the field is produced 
by convection and differential rotation of the gas. 
A completely satisfactory dynamo model for the 
solar magnetic cycle has not yet been found. For 
example, it is not yet known whether the field is 
produced everywhere in the convection zone, or 
just in the boundary layer between the convective 


left the Sun in visible light, on the right a magnetogram, 
which shows the opposite polarities of the magnetic fields 
as black and white. (Photo SOHO/NASA/ESA) 

and radiative regions, as some indications sug¬ 
gest. 

Other Activity The Sun shows several other 
types of surface activity: faculae and plages', 
prominences', flares. 

The faculae and plages are local bright regions 
in the photosphere and chromosphere, respec¬ 
tively. Observations of the plages are made in the 
hydrogen H„ or the calcium K lines (Fig. 13.16). 
The plages usually occur where new sunspots 
are forming, and disappear when the spots disap¬ 
pear. Apparently they are caused by the enhanced 
heating of the chromosphere in strong magnetic 
fields. 
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Fig. 13.17 (a) Quiescent “hedgerow” prominence. (Photograph Sacramento Peak Observatory.) (b) Larger eruptive 
prominence. (Photograph Big Bear Solar Observatory) 


The prominences are among the most spec¬ 
tacular solar phenomena. They are glowing gas 
masses in the corona, easily observed near the 
edge of the Sun. There are several types of promi¬ 
nences (Fig. 13.17): the quiescent prominences, 
where the gas is slowly sinking along the mag¬ 
netic field lines; loop prominences, connected 
with magnetic field loops in sunspots; and the 


rarer eruptive prominences, where gas is violently 
thrown outwards. 

The temperature of prominences is about 
10,000-20,000 K. In H, y photographs of the chro¬ 
mosphere, the prominences appear as dark fila¬ 
ments against the solar surface (Fig. 13.6). 

The flare outbursts are among the most violent 
forms of solar activity (Fig. 13.18). They appear 



13.3 Solar Activity 


295 


Fig. 13.18 A violent flare 
near some small sunspots. 
(Photograph Sacramento 
Peak Observatory) 



as bright flashes, lasting from one second to just 
under an hour. In the flares a large amount of en¬ 
ergy stored in the magnetic field is suddenly re¬ 
leased. The detailed mechanism is not yet known. 

Flares can be observed at all wavelengths. The 
hard X-ray emission of the Sun may increase hun¬ 
dredfold during a flare. Several different types 
of flares are observed at radio wavelengths. The 
emission of solar cosmic ray particles also rises. 

Prominences and flares are often accompa¬ 
nied with coronal mass ejections Fast moving 
(500-2000 km/s) clouds cause shock waves ac¬ 
celerating particles to very high velocities. Parti¬ 
cles in flares are the fastest (v ss 0.3c). Particles in 
a magnetic cloud travelling at the velocity of the 
mass ejection reach the Earth in a couple of days. 
Particles activated by the shock wave arrive con¬ 
tinuously affecting the space weather and causing 
magnetic storms. 

The flares and coronal mass ejections give rise 
to disturbances on the Earth. The X-rays cause 
changes in the ionosphere, which affect short¬ 
wave radio communications. The particles give 
rise to strong auroras when they enter the Earth’s 
magnetic field a few days after the outburst. 

Solar Radio Emission The Sun is the strongest 
radio source in the sky and has been observed 



Fig. 13.19 An X-ray picture of the active Sun, taken by 
the Japanese Yohkoh satellite in 1999, around the last 
maximum of sunspot activity. (Photo JAXA) 

since the 1940’s. In contrast to optical emis¬ 
sion the radio picture of the Sun shows a strong 
limb brightening. This is because the radio ra¬ 
diation comes from the upper layers of the at¬ 
mosphere. Since the propagation of radio waves 
is obstructed by free electrons, the high electron 
density near the surface prevents radio radiation 
from getting out. Shorter wavelengths can propa- 
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Fig. 13.20 The SOHO 
(Solar and Heliospheric 
Observatory) satellite 
keeps a constant watch on 
the Sun and its 
surroundings in many 
wavelengths. Here the 
LASCO (Large Angle and 
Spectrometric 
Coronagraph) instrument 
sees a large Coronal Mass 
Ejection erupting from the 
Sun. The surface of the 
Sun is covered by a disk, 
and the size and position of 
the Sun is indicated by the 
white circle. (Photo 
SOHO/NASA/ESA) 
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gate more easily, and thus millimetre-wavelength 
observations give a picture of deeper layers in the 
atmosphere, whereas the long wavelengths show 
the upper layers. (The 10 cm emission originates 
in the upper layers of the chromosphere and the 
1 m emission, in the corona.) 

The Sun looks different at different wave¬ 
lengths. At long wavelengths the radiation is 
coming from the largest area, and its electron 
temperature is about 10 6 K, since it originates 
in the corona. 

The radio emission of the Sun is constantly 
changing according to solar activity. During large 
storms the total emission may be 100,000 times 
higher than normal. Especially the motion of 
shock waves can be followed by the radio emis¬ 
sion, since the electrons accelerated by the shock 
generate radio emission (type II radio bursts). 

X-ray and UY Radiation The X-ray emis¬ 
sion of the Sun is also related to active regions 
(Fig. 13.19). Signs of activity are bright X-ray re¬ 
gions and smaller X-ray bright points , which last 
for around ten hours. The inner solar corona also 


emits X-rays. Near the solar poles there are coro¬ 
nal holes , where the X-ray emission is weak. 

Ultraviolet pictures of the solar surface show it 
as much more irregular than it appears in visible 
light. Most of the surface does not emit much UV 
radiation, but there are large active regions that 
are very bright in the ultraviolet. 

Several satellites have made observations of 
the Sun at UV and X-ray wavelengths, for ex¬ 
ample Soho (Solar and Heliospheric Observa¬ 
tory, 1995-, Fig. 13.20). These observations have 
made possible detailed studies of the outer lay¬ 
ers of the Sun. Observations of other stars have 
revealed coronae, chromospheres and magnetic 
variations similar to those in the Sun. Thus the 
new observational techniques have brought the 
physics of the Sun and the stars nearer to each 
other. 


13.4 Solar Wind and Space Weather 

A continuous stream of charged particles is com¬ 
ing from the Sun as the solar wind, varying 
with the solar activity. The coronal mass ejec- 
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tions are considered as “disturbances”. Around a 
minimum mass ejections may occur a few times 
a week; around a maximum there may be sev¬ 
eral ejections every day. The solar wind consists 
mainly of electrons and protons. At the distance 
of the Earth there are typically 5-10 particles per 
cubic centimetre. There are also some nuclei of 
helium atoms. The velocity is the solar wind close 
to the poles of the Sun is about 800 km/s but 
around the equator only about 300 km/s. At the 
distance of the Earth the velocity of the particles 
is about 500 km/s on the average. The Sun loses 
2 — 3 x 10 -14 Mq of its mass every year as solar 
wind. 

Magnetic fields of planets direct the motions 
of the particles of the solar wind (Sect. 7.7). The 
auroras are the most outstanding phenomenon on 
the Earth. Currents induced by the charged parti¬ 
cles can also have considerable negative effects. 
They can damage satellites and even electric net¬ 
works. The most serious accident this far hap¬ 
pened in 1989 in Quebec when a high voltage net¬ 
work was damaged and millions of people had to 
survive without electricity for several hours. 

Space weather mean the interaction of the so¬ 
lar wind and the magnetic environment of the 
Earth. Because of its effects it is now followed 
actively. The space weather affects the upper at¬ 
mosphere but it is still debated whether it is con¬ 
nected to variations of the weather and climate. 


13.5 Example 

Example 13.1 Assume that the Sun converts 
0.8 % of its mass into energy. Find an upper limit 


for the age of the Sun, assuming that its luminos¬ 
ity has remained constant. 

The total amount of energy released is 

E — me 1 = 0.008 Mqc 2 
= 0.008 x 2 x 10 30 kg x (3 x 10 8 ms -1 ) 2 
= 1.4 x 10 45 J. 

The time needed to radiate this energy is 

E _ 1.4 x 10 45 J 
r “ “ 3.9 X 10 26 W 

= 3.6 x 10 18 s^ 10 11 years. 

13.6 Exercises 

Exercise 13.1 The solar constant, i.e. the flux 
density of the solar radiation at the distance of 
the Earth is 1370 W m -2 . 

(a) Find the flux density on the surface of the 
Sun, when the apparent diameter of the Sun 
is 32'. 

(b) How many square metres of solar surface is 
needed to produce 1000 megawatts? 

Exercise 13.2 Some theories have assumed that 
the effective temperature of the Sun 4.5 billion 
years ago was 5000 K and radius 1.02 times 
the current radius. What was the solar constant 
then? Assume that the orbit of the Earth has not 
changed. 
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Stars with changing magnitudes are called vari¬ 
ables (Fig. 14.1). Variations in the brightness of 
stars were first noted in Europe at the end of the 
16th century, when Tycho Brahe’s supernova lit 
up (1572) and the regular light variation of the 
star o Ceti (Mira) was observed (1596). The num¬ 
ber of known variables has grown steadily as ob¬ 
servational precision has improved (Fig. 14.2). 
The most recent catalogues contain about 40,000 
stars known or suspected to be variable. 

Strictly speaking, all stars are variable. As 
was seen in Chap. 12, the structure and bright¬ 
ness of a star change as it evolves. Although 
these changes are usually slow, some evolution¬ 
ary phases may be extremely rapid. In certain 
evolutionary stages, there will also be periodic 
variations, for example pulsations of the outer 
layers of a star. 

Small variations in stellar brightness are also 
caused by hot and cool spots on a star’s surface, 
appearing and disappearing as it rotates about its 
axis. The luminosity of the Sun changes slightly 
because of the sunspots. Probably there are simi¬ 
lar spots on almost all stars. 

Initially stellar brightnesses were determined 
visually by comparing stars near each other. Later 
on, comparisons were made on photographic 
plates. At present the most accurate observations 
are made photoelectrically or using a CCD cam¬ 
era. The magnitude variation as a function of time 
is called the lightcurve of a star (Fig. 14.3). From 
it one obtains the amplitude of the magnitude 
variation and its period, if the variation is peri¬ 
odic. 


The basic reference catalogue of variable stars 
is the General Catalogue of Variable Stars by the 
Soviet astronomer Boris Vasilyevich Kukarkin. 
New, supplemented editions appear at times; the 
fourth edition published in 1985-1987, edited by 
P.N. Kholopov, contains about 32,000 variables 
of the Milky Way galaxy. 


14.1 Classification 

When a new variable is discovered, it is given 
a name according to the constellation in which 
it is located. The name of the first variable in 
a given constellation is R, followed by the name 
of the constellation (in the genitive case). The 
symbol for the second variable is S, and so on, 
to Z. After these, the two-letter symbols RR, 
RS,... to ZZ are used, and then A A to QZ (omit¬ 
ting I). This is only enough for 334 variables, 
a number that has long been exceeded in most 
constellations. The numbering therefore contin¬ 
ues: V335, V336, etc. (V stands for variable). 
For some stars the established Greek letter sym¬ 
bol has been kept, although they have later been 
found to be variable (e.g. S Cephei). 

The classification of variables is based on the 
shape of the lightcurve, and on the spectral class 
and observed radial motions. The spectrum may 
also contain dark absorption lines from material 
around the star. Observations can be made out¬ 
side the optical region as well. Thus the radio 
emission of some variables (e.g. flare stars) in¬ 
creases strongly, simultaneously with their opti¬ 
cal brightness. Examples of radio and X-ray vari- 
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Fig. 14.1 The variables 
are stars changing in 
brightness. Two variables 
in Scorpius, R and S Sco. 
(Photograph Yerkes 
Observatory) 



Fig. 14.2 The location of 
variables in the HR 
diagram 



ables are the radio and X-ray pulsars, and the Variables are usually divided into three main 
X-ray bursters. types: pulsating, eruptive and eclipsing variables. 
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Fig. 14.3 The variation of brightness, colour and size of 
a cepheid during its pulsation 

The eclipsing variables are binary systems in 
which the components periodically pass in front 
of each other. In these variables the light varia¬ 
tions do not correspond to any physical change 
in the stars. They have been treated in connection 
with the binary stars. In the other variables the 
brightness variations are intrinsic to the stars. In 
the pulsating variables the variations are due to 
the expansion and contraction of the outer lay¬ 
ers. These variables are giants and supergiants 
that have reached an unstable stage in their evolu¬ 
tion. The eruptive variables are usually faint stars 
ejecting mass. They are mostly members of close 
binary systems in which mass is transferred from 
one component to the other. 

In addition a few rotating variables are known, 
where the brightness variations are due to an 
uneven temperature distribution on the surface, 
starspots coming into sight when the star rotates. 
Such stars may be quite common—after all, our 
Sun is a weak rotating variable. The most promi¬ 
nent group of rotating variables are the magnetic 
A stars (e.g. the a 2 Canum Venaticorum stars). 
These stars have strong magnetic fields that may 
be giving rise to starspots. The periods of rotating 


Table 14.1 The main properties of pulsating variables 
(N, number of stars of the given type in Kukarkin’s cat¬ 
alogue, P, pulsation period in days, Am, pulsation ampli¬ 
tude in magnitudes) 


Variable 

N 

P 

Spectrum 

Am 

Classical cepheids 
(5 Cep, WVir) 

800 

1-135 

F-K I 


RR Lyrae 

6100 

<1 

A-F8 

£2 

Dwarf cepheids 
(S Scud) 

200 

0.05-7 

A-F 

£1 

/! Cephei 

90 

0.1-0.6 

B1-B3 III 

>0.3 

Mira variables 

5800 

80-1000 

M-C 

>2.5 

RV Tauri 

120 

30-150 

G-M 

£4 

Semiregular 

3400 

30-1000 

K-C 

£4.5 

Irregular 

2300 

- 

K-M 

<2 


variables range from about 1 day to 25 d, and the 
amplitudes are less than 0.1 mag. 


14.2 Pulsating Variables 

The wavelengths of the spectral lines of the pul¬ 
sating variables change along with the brightness 
variations (Table 14.1). These changes are due to 
the Doppler effect, showing that the outer layers 
of the star are indeed pulsating. The observed gas 
velocities are in the range of 40-200 km/s. 

The period of pulsation corresponds to a pro¬ 
per frequency of the star. Just like a tuning fork 
vibrates with a characteristic frequency when hit, 
a star has a fundamental frequency of vibration. 
In addition to the fundamental frequency other 
frequencies, “overtones”, are possible. The ob¬ 
served brightness variation can be understood 
as a superposition of all these modes of vibra¬ 
tion. Around 1920, the English astrophysicist Sir 
Arthur Eddington showed that the period of pul¬ 
sation P is inversely proportional to the square 
root of the mean density, 

1 

P oc-. (14.1) 

J~P 

The diameter of the star may double during the 
pulsation, but usually the changes in size are mi¬ 
nor. The main cause of the light variation is the 
periodic variation of the surface temperature. We 
have seen in Sect. 5.7 that the luminosity of a star 
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depends sensitively on its effective temperature, 
L oc Tg 4 . Thus a small change in effective temper¬ 
ature leads to a large brightness variation. 

Normally a star is in stable hydrostatic equi¬ 
librium. If its outer layers expand, the density and 
temperature decrease. The pressure then becomes 
smaller and the force of gravity compresses the 
gas again. However, unless energy can be trans¬ 
ferred to the gas motions, these oscillations will 
be damped. 

The flux of radiative energy from the stellar 
interior could provide a source of energy for the 
stellar oscillations, if it were preferentially ab¬ 
sorbed in regions of higher gas density. Usually 
this is not the case but in the ionisation zones, 
where hydrogen and helium are partially ionised, 
the opacity in fact becomes larger when the gas is 
compressed. If the ionisation zones are at a suit¬ 
able depth in the atmosphere, the energy absorbed 
during compression and released during expan¬ 
sion of an ionisation zone can drive an oscillation. 
Stars with surface temperatures of 6000-9000 K 
are liable to this instability. The corresponding 
section of the HR diagram is called the cepheid 
instability strip. 

Cepheids Among the most important pulsating 
variables are the cepheids, named after S Cephei 
(Fig. 14.3). They are population I supergiants 
(stellar populations are discussed in Sect. 18.2) 
of spectral class F-K. Their periods are 1-50 
days and their amplitudes, 0.1-2.5 magnitudes. 
The shape of the light curve is regular, showing 
a fairly rapid brightening, followed by a slower 
fall off. There is a relationship between the period 
of a cepheid and its absolute magnitude (i.e. lu¬ 
minosity), discovered in 1912 by Henrietta Leav¬ 
itt from cepheids in the Small Magellanic Cloud. 
This period-luminosity relation (Fig. 14.4) can 
be used to measure distances of stars and nearby 
galaxies. 

We have already noted that the pulsation pe¬ 
riod is related to the mean density. On the other 
hand the size of a star, and hence its mean density, 
is related to its total luminosity. Thus one can un¬ 
derstand why there should be a relation between 
the period and the luminosity of a pulsating star. 



Fig. 14.4 The period-luminosity relation for cepheids. 
The black points and squares are theoretically calculated 
values, the crosses and the straight line represent the ob¬ 
served relation. (Drawing from Novotny, E. (1973): Intro¬ 
duction to Stellar Atmospheres and Interiors (Oxford Uni¬ 
versity Press, New York) p. 359) 

The magnitudes M and periods P of classical 
cepheids are shown in Fig. 14.4. The relation be¬ 
tween M and log P is linear. However, to some 
extent, the cepheid luminosities also depend on 
colour: bluer stars are brighter. For accurate dis¬ 
tance determinations, this effect needs to be taken 
into consideration. 

WVirginis Stars In 1952 Walter Baade noted 
that there are in fact two types of cepheids: the 
classical cepheids and the W Virginis stars. Both 
types obey a period-luminosity relation, but the 
W Vir stars of a given period are 1.5 magnitudes 
fainter than the corresponding classical cepheids. 
This difference is due to the fact that the classical 
cepheids are young population I objects, whereas 
the WVir stars are old stars of population II. 
Otherwise, the two classes of variables are sim¬ 
ilar. 

Earlier, the W Vir period-luminosity relation 
had been used for both types of cepheids. Con¬ 
sequently the calculated distances to classical 
cepheids were too small. For example, the dis¬ 
tance to the Andromeda Galaxy had been based 
on classical cepheids, since only these were 
bright enough to be visible at that distance. When 
the correct period-luminosity relation was used, 
all extragalactic distances had to be doubled. Dis¬ 
tances within the Milky Way did not have to be 
changed, since their measurements were based on 
other methods. 
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Fig. 14.5 The lightcurve of a long period Mira variable 

RR Lyrae Stars The third important class of 
pulsating variables are the RR Lyrae stars. Their 
brightness variations are smaller than those of the 
cepheids, usually less than a magnitude. Their pe¬ 
riods are also shorter, less than a day. Like the 
WVir stars, the RR Lyrae stars are old popula¬ 
tion II stars. They are very common in the glob¬ 
ular star clusters and were therefore previously 
called cluster variables. 

The absolute magnitudes of the RR Lyrae stars 
are about My — 0.6 ± 0.3. They are all of roughly 
the same age and mass, and thus represent the 
same evolutionary phase, where helium is just be¬ 
ginning to burn in the core. Since the absolute 
magnitudes of the RR Lyrae variables are known, 
they can be used to determine distances to the 
globular clusters. 

Mira Variables (See Fig. 14.5.) The Mira vari¬ 
ables (named after Mira Ceti) are supergiants of 
spectral classes M, S or C, usually with emis¬ 
sion lines in their spectrum. They are losing gas 
in a steady stellar wind. Their periods are nor¬ 
mally 100-500 days, and for this reason, they 
are also sometimes called long period variables. 
The amplitude of the light variations is typically 
about 6 magnitudes in the visual region. The pe¬ 
riod of Mira itself is about 330 days and its di¬ 
ameter is about 2 au. At its brightest, Mira has 
the magnitude 2-4, but at light minimum, it may 
be down to 12. The effective temperature of the 
Mira variables is only about 2000 K. Thus 95 % 
of their radiation is in the infrared, which means 
that a very small change in temperature can cause 
a very large change in visual brightness. 

Other Pulsating Variables One additional lar¬ 
ge group of pulsating stars are the semiregular 


and irregular variables. They are supergiants, of¬ 
ten very massive young stars with unsteady pul¬ 
sations in their extended outer layers. If there 
is some periodicity in the pulsations, these vari¬ 
ables are called semiregular; otherwise they are 
irregular. An example of a semiregular variable 
is Betelgeuse (a Orionis). The pulsation mecha¬ 
nism of these stars is not well understood, since 
their outer layers are convective, and the theory 
of stellar convection is still poorly developed. 

In addition to the main types of pulsating vari¬ 
ables, there are some smaller separate classes, 
shown in Fig. 14.2. 

The dwarf cepheid and the S Scuti stars, which 
are sometimes counted as a separate type, are lo¬ 
cated below the RR Lyrae stars in the cepheid 
instability strip in the HR diagram. The dwarf 
cepheids are fainter and more rapidly varying 
than the classical cepheids. Their light curves of¬ 
ten show a beating due to interference between 
the fundamental frequency and the first overtone. 

The (i Cephei stars are located in a different 
part of the HR diagram than the other variables. 
They are hot massive stars, radiating mainly in 
the ultraviolet. The variations are rapid and of 
small amplitude. The pulsation mechanism of the 
ft Cephei stars is unknown. 

The RV Tauri stars lie between the cepheids 
and the Mira variables in the HR diagram. Their 
period depends slightly on the luminosity. There 
are some unexplained features in the light curves 
of the RV Tauri stars, e.g. the minima are alter¬ 
nately deep and shallow. 


14.3 Eruptive Variables 

In the eruptive variables there are no regular 
pulsations. Instead sudden outbursts occur in 
which material is ejected into space. Nowadays 
such stars are divided into two main categories, 
eruptive and cataclysmic variables. Brightness 
changes of eruptive variables are caused by sud¬ 
den eruptions in the chromosphere or corona, the 
contributions of which are, however, rather small 
in the stellar scale. These stars are usually sur¬ 
rounded by a gas shell or interstellar matter par¬ 
ticipating in the eruption. This group includes e.g. 
flare stars , various kinds of nebular variables. 
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Table 14.2 Main properties of eruptive variables ( N , 
number of stars of the given type in Kukarkin’s catalogue, 
Am, change in brightness in magnitudes. The velocity is 
the expansion velocity in km/s, based on the Doppler shifts 
of the spectral lines) 


Variable 

N 

Am 

Velocity 

Supernovae 

7 

>20 

4000-10,000 

Ordinary novae 

210 

7-18 

200-3500 

Recurrent novae 

£10 

600 

Nova-like stars 
(P Cygni, symbiotic) 

80 

<2 

30-100 

Dwarf novae 
(SS Cyg = U Gem, 

ZZ Cam) 

330 

2-6 

(700) 

R Coronae Borealis 

40 

1-9 

- 

Irregular 

(nebular variables, 

T Tau, RW Aur) 

1450 

£4 

(300) 

Flare stars 
(UV Ceti) 

750 

£6 

2000 


and R Coronae Borealis stars. Eruptions of the 
cataclysmic variables are due to nuclear reactions 
on the stellar surface or interior. Explosions are so 
violent that they can even destroy the whole star. 
This group includes novae and nova-like stars, 
dwarf novae and supernovae (Table 14.2). 

Flare Stars The flare or IIV Ceti stars are 
dwarf stars of spectral class M. They are young 
stars, mostly found in young star clusters and as¬ 
sociations. At irregular intervals there are flare 
outbursts on the surface of the stars similar to 
those on the Sun. The flares are related to dis¬ 
turbances in the surface magnetic fields. The en¬ 
ergy of the outbursts of the flare stars is appar¬ 
ently about the same as in solar flares, but because 
the stars are much fainter than the Sun, a flare 
can cause a brightening by up to 4-5 magnitudes. 
A flare lights up in a few seconds and then fades 
away in a few minutes (Fig. 14.6). The same star 
may flare several times in one day. The optical 
flare is accompanied by a radio outburst, like in 
the Sun. In fact, the flare stars were the first stars 
to be detected as radio sources. 

Nebular Variables In connection with bright 
and dark interstellar clouds e.g. in the constel¬ 
lations of Orion, Taurus and Auriga, there are 



Fig. 14.6 The outbursts of typical flare stars are of short 
duration 



Fig. 14.7 Light curve of a T Tauri variable 

variable stars. The T Tauri stars are the most in¬ 
teresting of them. These stars are newly formed 
or just contracting towards the main sequence. 
The brightness variations of the T Tauri stars 
are irregular (Fig. 14.7). Their spectra contain 
bright emission lines, formed in the stellar chro¬ 
mosphere, and forbidden lines, which can only be 
formed at extremely low densities. The spectral 
lines also show that matter is streaming out from 
the stars. 

Since the T Tauri stars are situated inside 
dense gas clouds, they are difficult to observe. 
However, this situation has improved with the de¬ 
velopment of radio and infrared techniques. 

Stars in the process of formation may change 
in brightness very rapidly. For example, in 1937, 
FU Orionis brightened by 6 magnitudes. This star 
is a strong source of infrared radiation, which 
shows that it is still enveloped by large quantities 
of interstellar dust and gas. A similar brighten¬ 
ing by six magnitudes was observed in 1969 in 
V1057 Cygni (Fig. 14.8). Before its brightening, 
it was an irregular T Tauri variable; since then, 
it has remained a fairly constant tenth-magnitude 
AB star. 
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Stars of the R Coronae Borealis type have “in¬ 
verse nova” light curves. Their brightness may 
drop by almost ten magnitudes and stay low for 
years, before the star brightens to its normal lu¬ 
minosity. For example, R CrB itself is of magni¬ 
tude 5.8, but may fade to 14.8 magnitudes. Fig¬ 
ure 14.9 shows its recent decline, based on ob¬ 
servations by Finnish and French amateurs. The 
R CrB stars are rich in carbon and the decline is 
produced when the carbon condenses into a cir- 
cumstellar dust shell. 

One very interesting variable is i] Cannae 
(Fig. 14.10). At present it is a six magnitude 
star surrounded by a thick, extensive envelope of 
dust and gas. In the early 19th century rj Carinae 
was the second brightest star in the sky after Sir¬ 
ius. Around the middle of the century it rapidly 
dimmed to magnitude 8, but during the 20th cen¬ 
tury it has brightened somewhat, i] Carinae is a 
so called bright blue variable. Its mass is esti¬ 
mated to be of the order of 100 solar masses. The 
circumstellar dust cloud is the brightest infrared 
source in the sky outside the solar system. The en- 



Fig. 14.8 In 1969-1970, the star V1057 Cygni bright¬ 
ened by almost 6 magnitudes 


ergy radiated by ?; Carinae is absorbed by the neb¬ 
ula and re-radiated at infrared wavelengths. The 
exact reason for the enormous brightness varia¬ 
tions is not known. It is believed that the stability 
of the star is disturbed by the radiation pressure 
of the huge energy produced by a very massive 
star. When the nucleus will collapse at the end of 
the lifespan of the star, ij Carinae will explode as 
a supernova. 

Novae One of the best known types of erup¬ 
tive variables are the novae. They are classified 
into several subtypes: ordinary novae, recurrent 
novae and nova-like variables. The dwarf novae 
(Fig. 14.1 1) are nova-like rather frequently erup¬ 
tive stars; although the lightcurves are similar to 
those of novae, the mechanism is different. 

The outbursts of all novae are rapid. Within 
a day or two the brightness rises to a maximum, 
which may be 7-16 magnitudes brighter than the 
normal luminosity. This is followed by a gradual 
decline, which may go on for months or years. 
The light curve of a typical nova is shown in 
Fig. 14.13. This light curve of Nova Cygni 1975 
has been composed from hundreds of observa¬ 
tions, mostly by amateurs. 

In recurrent novae, the brightening is some¬ 
what less than 10 magnitudes and in dwarf novae, 
2-6 magnitudes. In both types there are repeated 
outbursts. For recurrent novae the time between 
outbursts is a few decades and for the dwarf no¬ 
vae 20-600 days. The interval depends on the 
strength of the outburst: the stronger the outburst, 
the longer the time until the next one. The bright- 


Fig. 14.9 The decline of 
R Coronae Borealis in 
1977-1978; observations 
by Finnish and French 
amateur astronomers. 
(Kellomaki, Tahdet ja 
Avaruus 5/1978) 
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Fig. 14.10 In the 19th 

century, r) Carinae was one 
of the brightest stars in the 
sky; since then it has 
dimmed considerably. In 
an outburst in 1843 the star 
ejected an expanding 
nebula, which has been 
called “Homunculus”. The 
Hubble photograph has 
been printed negative, to 
show finer details in the 
bipolar outflow. 
(Photograph 

NASA/HST/University of 
Minnesota) 



Fig. 14.11 The light 
curve of the dwarf nova SS 
Cygni in the beginning of 
1966. (Drawing by Martti 
Perala is based on 
observations by Nordic 
amateurs) 


7.0 

— 

—i-1-1— 

-1-1-1-1-1-1-- 

8.0 



**• : 

9.0 


j » 

o * : 



* » 

x. . * 

10.0 

— ♦ 

t 

* * 



\ 

t 

11.0 


i - 

♦ 4 



t , 

t 

* t * 

12.0 


‘•.'ty.'v-v ■> . *• * • - 

13.0 

1 

1 1 ! 

1 1 1 1 ll _ 


8440 

8480 8520 8560 

8600 8640 8680 8720 8760 8800 


ening in magnitudes is roughly proportional to sible that ordinary novae obey the same relation- 
the logarithm of the recharging interval. It is pos- ship. However, their amplitude is so much larger 
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Fig. 14.12 The novae are 
thought to be white dwarfs 
accreting matter from 
a nearby companion star. 
At times, nuclear reactions 
burning the accreted 
hydrogen are ignited, and 
this is seen as the flare-up 
of a nova 



that the time between outbursts should be thou¬ 
sands or millions of years. 

Observations have shown all novae and dwarf 
novae to be members of close binary systems. 
One component of the system is a normal star and 
the other is a white dwarf surrounded by a gas 
ring. (The evolution of close binary systems was 
considered in Sect. 12.6, where it was seen how 
this kind of system might have been formed.) 
The normal star fills its Roche surface, and ma¬ 
terial from it streams over to the white dwarf. 
When enough mass has collected on the surface 
of the white dwarf, the hydrogen is explosively 
ignited and the outer shell is ejected. The bright¬ 
ness of the star grows rapidly. As the ejected 
shell expands, the temperature of the star drops 
and the luminosity gradually decreases. How¬ 
ever, the outburst does not stop the mass transfer 
from the companion star, and gradually the white 
dwarf accretes new material for the next explo¬ 
sion (Fig. 14.12). 

The emission and absorption lines from the ex¬ 
panding gas shell can be observed in the spec¬ 
trum of a nova. The Doppler shifts correspond 
to an expansion velocity of about 1000 km/s. As 
the gas shell disperses, its spectrum becomes that 
of a typical diffuse emission nebula. The expand¬ 
ing shell around a nova can also sometimes be 
directly seen in photographs. 

In dwarf novae the energy does not come from 
nuclear reactions but mainly from the potential 
energy of the matter falling into the white dwarf. 
An eruption occurs when the density of the accre¬ 


tion disk formed by the matter flowing from the 
companion star exceeds a certain critical limit, 
becomes unstable and is strongly heated. 

A considerable fraction of the novae in our 
Galaxy are hidden by interstellar clouds and their 
number is therefore difficult to estimate. In the 
Andromeda Galaxy, observations indicate 25- 
30 nova explosions per year. The number of 
dwarf novae is much larger. In addition there 
are nova-like variables, which share many of the 
properties of novae, such as emission lines from 
circumstellar gas and rapid brightness variations. 
These variables, some of which are called symbi¬ 
otic stars , are close binaries with mass transfer. 
Gas streaming from the primary hits a gas disk 
around the secondary in a hot spot, but there are 
no nova outbursts. 

The supersoft stars (SSS) are a somewhat pe¬ 
culiar subclass of cataclysmic variables. Their X- 
ray radiation is much stronger and much softer, 
i.e. has longer wavelength, than that of the ordi¬ 
nary cataclysmic variables. In the SSS objects the 
component losing mass is more massive than the 
compact star (white dwarf), in contrast with or¬ 
dinary interacting binaries. Therefore the mass 
transfer is an unstable self sustained process 
transferring more material to the surface of the 
white dwarf than in the normal cataclysmic vari¬ 
ables. On the surface of the white dwarf a contin¬ 
uous fusion reaction resembling nova eruptions 
is producing a lot of soft X-ray radiation. Due to 
their unstable character the SSS objects are short 
lived and therefore relatively rare. 
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Fig. 14.13 In 1975 a new 
variable, Nova Cygni or 
V1500Cygni, was 
discovered in Cygnus. In 
the upper photograph the 
nova is at its brightest 
(about 2 magnitudes), and 
in the lower photograph it 
has faded to magnitude 15. 
(Photographs Lick 
Observatory) 



14.4 Supernovae 

Explosions of stars as supernovae are among 
the most energetic phenomena of the universe. 
Within a couple of weeks the star becomes so 
bright that its luminosity corresponds to over a 
billion Suns or a whole small galaxy. Thus they 
can be observed even at cosmological distances, 
and they are useful standard candles for measur¬ 


ing the dimensions of the universe and cosmo¬ 
logical quantities. Supernovae are also important 
sources of heavy elements. Most elements heav¬ 
ier than iron have originated in supernova explo¬ 
sions. 

The maximum is followed by a slow decay, 
and the nearest supernovae can be observed still 
years after the explosion. In the explosion a gas 
shell expanding with a velocity of thousands of 
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Fig. 14.14 The type of a supernova can be deduced from 
different spectral lines in its spectrum and the shape of its 
lightcurve 


kilometres per second is ejected. Over 200 of 
such supernova remnants have been discovered in 
the Milky Way. Their ages vary from a few hun¬ 
dred to tens of thousands of years. 

At least six supernova explosions have been 
observed in the Milky Way. Best known are the 
“guest star” seen in China in 1054 (whose rem¬ 
nant is the Crab nebula), Tycho Brahe’s super¬ 
nova in 1572 and Kepler’s supernova in 1604. On 
the basis of observations of other Sb-Sc-type spi¬ 
ral galaxies, the interval between supernova ex¬ 
plosions in the Milky Way is predicted to be about 
50 years. Some will be hidden by obscuring ma¬ 
terial, especially near the centre of the galaxy. In 
particularly dusty starburst galaxies almost all su¬ 
pernovae remain undetected due to the extinction 
of the dust. Still the 400 years’ interval since the 
last observed supernova in the Milky Way is un¬ 
usually long. 

Supernovae in other galaxies are detected 
quite frequently. For example, both in 2006 and 
2007 over 500 new supernovae were found. In 
the near future the number will increase rapidly 
thanks to new all sky surveys. 

Earlier supernovae were divided just to two 
main categories, I and II, based on whether their 
spectra showed evidence of hydrogen. Later these 
main classes have been divided into several sub- 



Fig. 14.15 Typical shapes of the lightcurves of different 
supernovae 

classes depending on other spectral features and 
their lightcurves (Figs. 14.14, 14.15 and 14.16). 

Type la supernovae are recognised by the sil¬ 
icon absorption line (615 nm), which is strong 
in the spectra of young novae. It is thought that 
these supernovae originate in binaries, at least a 
billion years old, consisting of a white dwarf and 
its companion. In such a close binary material can 
flow from the companion to the white dwarf un¬ 
til it mass exceeds the limit (about 1.4Mq). Then 
the pressure of the degenerate of the electron gas 
inside the white dwarf cannot overcome the grav¬ 
itation and star will collapse. Increase of the den¬ 
sity and temperature will ignite an explosive fu¬ 
sion reaction that can destroy the white dwarf. 
The kinetic energy released in the process is of 
the order of 10 44 J causing part of the gas to ex¬ 
pand even at a velocity of 0.1c. The origin of the 
radiation energy (around 10 42 J) is the fission of 
the radioactive nickel isotope 56 created in the 
explosion (typically about 0.5 Mq) into radioac¬ 
tive cobalt and further into stable iron. The more 
nickel is produced in the explosion the brighter 
the supernova shines at its maximum; the abso¬ 
lute magnitude will reach —19—20. The shape 
of the lightcurve depends on the brightness of the 
supernova: the brighter the supernova the broader 
the top of the lightcurve. Therefore the maximum 
brightness of a typer la supernova can be de¬ 
termined precisely from its lightcurve. This way 
they can be used as standard candles to determine 
dimensions of the universe and cosmological pa¬ 
rameters (Chap. 20). 

In Chap. 12 we discussed the final stages of 
stellar evolution and how the collapse of a mas¬ 
sive star will lead to an explosion. It is believed 
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Fig. 14.16 Spectra of 
different types of 
supemovae at the 
maximum, three weeks 
after the maximum and 
after one year. (Turatto: 
Supernovae and 
Gamma-Ray Bursters, ed. 
K. Weiler, Lecture Notes in 
Physics, vol. 598, 
pp. 21-36) 



that all supemovae except type la are caused by 
the collapse of massive and short lived stars. At 
the end of its lifespan such a star has an iron 
core that will collapse under its own gravity after 
exceeding the Chandrasekhar limit. In the stellar 
core the matter will reach the density of atomic 
nuclei and the outer layers bounce back send¬ 
ing an outbound shock wave. The process cre¬ 
ates a huge number of neutrinos. Since they in¬ 
teract only weakly with the outer layers of the 
star they rush to the surrounding space. The re¬ 
maining core may become a neutron star or, in 
the case of a very massive star, a black hole. Most 
of the released energy (about 99 %) escapes with 
the neutrinos. The outer layers of the star receive 
a kinetic energy of about 10 44 J, part of which 
may be released as radiation when the expanding 
matter hits the material surrounding the star and 
later the interstellar matter. Typically the super¬ 
nova itself emits radiation about 10 42 J within a 
few months after the explosion, reaching an ab¬ 
solute magnitude between —16 and —20 at the 
maximum. Later the brightness is sustained by 
the fission of the radioactive nickel 56, born in the 
explosion, to other elements just as in the case of 
the type la supernovae. 

If the spectrum of a supernova does not con¬ 
tain strong hydrogen or silicon lines its type is 


either lb or Ic. The type lb supernovae are recog¬ 
nised by strong helium lines that are not present 
in the spectra of type Ic. These supernovae orig¬ 
inate in stars that don’t have any more hydrogen 
in their outer layer. The stars exploding as type Ic 
supernovae have also lost their outer layer of he¬ 
lium. Such stars that have lost their outer layers 
are called Wolf-Rayet stars. 

All type II supernovae have hydrogen lines in 
their spectra. They are divided into main types 
II-P and II-L based on the flat (plateau) and lin¬ 
ear shape of the lightcurve. Recently several stars 
that later exploded as supernovae have been iden¬ 
tified in high resolution pictures taken e.g. by the 
Hubble space telescope and the VLT telescope 
before the explosion (Fig. 14.17). These obser¬ 
vations have shown that the more common type 
II-P supernovae originate in red supergiant stars 
as predicted by theories of stellar evolution. Ini¬ 
tially the masses of such stars have been at least 8 
solar masses. Smaller stars are not believed to ex¬ 
plode in a collapse. They end up as white dwarfs 
after more quiescent evolution. 

Type Iln supernovae are recognised by their 
hydrogen emission lines that are much narrower 
than in the spectra of other supernovae. The lines 
originate in the matter around the star. The spec¬ 
tra of young type Iln supernovae have hydrogen 
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lines, which, however, disappear in a few weeks, 
and afterwards the spectrum resembles the type 
lb spectra. It is assumed that these supernovae 
were originally massive stars that still had a small 
amount of hydrogen in their outer layer before the 
explosion. Thus they are intermedia between the 
types II and lb. The observed properties and types 
of supernovae depend on the properties of their 
progenitors, and hence individual supernovae can 
of intermediate types. Hence it makes more sense 


to classify supernovae according to the explosion 
mechanism simply to two different types, ther¬ 
monuclear and core collapse supernovae. About 
30 % of the supernovae found in nearby galaxies 
are thermonuclear and the remaining 70 % core 
collapse supernovae. 

On February 23, 1987 the first burst of light 
from a supernova in the Large Magellanic Cloud, 
the small companion galaxy of the Milky Way, 
reached the Earth (Fig. 14.18). This supernova, 


Fig. 14.17 Left: The 
supernova 2003gd of type 
II-P after the explosion. 
Right: The same region 
before the explosion. The 
exploded red supergiant 
(progenitor) was located in 
the image taken before the 
explosion. Both images 
were taken with the Hubble 
space telescope. (Smartt et 
al., 2004, Science, 
vol. 303, 5657, 499) 




Fig. 14.18 Supernova 1987A in the Large Magellanic Cloud before and after the explosion. (Photographs ESO) 
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SN 1987A, was of type II, and was the bright¬ 
est supernova for 383 years. After its first detec¬ 
tion, SN 1987A was studied in great detail by all 
available means. Although the general ideas of 
Sects. 11.4 and 11.5 on the final stages of stellar 
evolution have been confirmed, there are compli¬ 
cations. Thus e.g. the progenitor star was a blue 
rather than a red giant as expected, perhaps be¬ 
cause of the lower abundance of heavy elements 
in the Large Magellanic Cloud compared to that 
in the Milky Way. The collapse of its core re¬ 
leased a vast amount of energy as a pulse of neu¬ 
trinos, which was detected in Japan and the USA. 
The amount of energy released indicates that the 
remnant is a neutron star. 


14.5 Examples 


This gives 


Rn 


lg —= —0.2 Am — 21g : 


Rmn 


5000 

-0.4 -21g-= -0.24, 

5 6000 


whence 


Rn 


Rn 


= 0.57. 


14.6 Exercises 

Exercise 14.1 The absolute visual magnitude of 
RRLyrae variables is 0.6 ± 0.3. What is the rela¬ 
tive error of distances due to the deviation in the 
magnitude? 


Example 14.1 The observed period of a cepheid 
is 20 days and its mean apparent magnitude m — 
20. From Fig. 14.4, its absolute magnitude is 
M ~ —5. According to (4.12), the distance of the 
cepheid is 

r = 10 x 10 {m_jW)/5 = 10 x lo (20+5) / 5 
= 10 6 pc = 1 Mpc. 


Example 14.2 The brightness of a cepheid 
varies 2 mag. If the effective temperature is 
6000 K at the maximum and 5000 K at the mini¬ 
mum, how much does the radius change? 

The luminosity varies between 


2 4 

Cmax = 47rR^ lax crT , max , 
Lmin = 4jrR“ un crl^ in . 


In magnitudes the difference is 


A m = —2.5 lg - 


= —2.51g 


4j r4n<in 

4 7i crT 4 

-tji J v max '- / ± m ax 


, ^min r. , -^min 

= ~51g--!0 lg ——. 

^max -*max 


Exercise 14.2 The bolometric magnitude of 
a long period variable varies by one magnitude. 
The effective temperature at the maximum is 
4500 K. 

(a) What is the temperature at the minimum, if 
the variation is due to temperature change 
only? 

(b) If the temperature remains constant, what is 
the relative variation in the radius? 

Exercise 14.3 In 1983 the radius of the Crab 
nebula was about 3'. It is expanding 0.21" a year. 
Radial velocities of 1300 kms -1 with respect to 
the central star have been observed in the nebula. 

(a) What is the distance of the nebula, assuming 
its expansion is symmetric? 

(b) A supernova explosion has been observed in 
the direction of the nebula. Estimate, how 
long time ago? 

(c) What was the apparent magnitude of the su¬ 
pernova, if the absolute magnitude was a typ¬ 
ical -18? 













Compact Stars 



In astrophysics those stars in which the density 
of matter is much larger than in ordinary stars are 
known as compact objects. These include white 
dwarfs, neutron stars, and black holes. In addi¬ 
tion to a very high density, the compact objects 
are characterised by the fact that nuclear reactions 
have completely ceased in their interiors. Conse¬ 
quently they cannot support themselves against 
gravity by thermal gas pressure. In the white 
dwarfs and neutron stars, gravity is resisted by the 
pressure of a degenerate gas. In the black holes 
the force of gravity is completely dominant and 
compresses the stellar material to infinite density. 

Compact stars in binary systems give rise to a 
variety of striking new phenomena. If the com¬ 
panion star is losing mass by a stellar wind or 
a Roche lobe overflow, the gas that is shed may 
be accreted by the compact object. This will re¬ 
lease gravitational energy that can be observable 
in the form of X-ray emission and strong and 
rapid brightness variations. 


15.1 White Dwarfs 

As was mentioned in Sect. 11.2, in ordinary stars 
the pressure of the gas obeys the equation of state 
of an ideal gas. In stellar interiors the gas is fully 
ionised, i.e. it is plasma consisting of ions and 
free electrons. The partial pressures of the ions 
and electrons together with the radiation pressure 
important in hot stars comprise the total pres¬ 
sure balancing gravitation. When the star runs 
out of its nuclear fuel, the density in the interior 
increases, but the temperature does not change 


much. The electrons become degenerate, and the 
pressure is mainly due to the pressure of the de¬ 
generate electron gas, the pressure due to the ions 
and radiation being negligible. The star becomes 
a white dwarf. 

As will be explained in Box 15.1 the radius of 
a degenerate star is inversely proportional to the 
cubic root of the mass. Unlike in a normal star the 
radius decreases as the mass increases. 

The first white dwarf to be discovered was Sir¬ 
ius B, the companion of Sirius (Fig. 15.1). Its ex¬ 
ceptional nature was realised in 1915, when it 
was discovered that its effective temperature was 
very high. Since it is faint, this meant that its ra¬ 
dius had to be very small, slightly smaller than 
that of the Earth. The mass of Sirius B was known 
to be about equal to that of the Sun, so its density 
had to be extremely large. 

The high density of Sirius B was confirmed in 
1925, when the gravitational redshift of its spec¬ 
tral lines was measured. This measurement also 
provided early observational support to Einstein’s 
general theory of relativity. 

White dwarfs occur both as single stars and 
in binary systems. Their spectral lines are broad¬ 
ened by the strong gravitational field at the sur¬ 
face. In some white dwarfs the spectral lines are 
further broadened by rapid rotation. Strong mag¬ 
netic fields have also been observed. 

White dwarfs have no internal sources of en¬ 
ergy, but further gravitational contraction is pre¬ 
vented by the pressure of the degenerate electron 
gas. Radiating away the remaining heat, white 
dwarfs will slowly cool, changing in colour from 
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Fig. 15.1 Two views of the best-known white dwarf Sir¬ 
ius B. the small companion to Sirius. On the left, a picture 
in visible light by the Hubble Space Telescope. Sirius B 
is the tiny white dot on lower left from the overexposed 
image of Sirius. On the right, an X-ray picture of the pair 

white to red and finally to black. The cooling time 
is comparable to the age of the Universe, and even 
the oldest white dwarfs should still be observable. 
Looking for the faintest white dwarfs has been 
used as a way to set a lower limit on the age of 
the Universe. 

Cataclysmic Variables When a white dwarf is 
a member of a close binary system, it can ac¬ 
crete mass from its companion star. The most in¬ 
teresting case is where a main sequence star is 
filling its Roche lobe, the largest volume it can 
have without spilling over to the white dwarf. As 
the secondary evolves, it expands and begins to 
lose mass, which is eventually accreted by the 
primary. Binary stars of this kind are known as 
cataclysmic variables. 

The present definition of the class of cata¬ 
clysmic variables has gradually evolved, and in 
consequence many types of systems that were 
earlier viewed as separate are now collected un¬ 
der this heading. In principle, even type la super¬ 
novae should be included. The classical novae, 
whose eruptions are caused by the sudden igni¬ 
tion of hydrogen that has collected on the sur¬ 
face of the white dwarf, have been described in 
Sect. 14.3. In the eruptions most of the accreted 


taken by the Chandra X-ray observatory. Sirius B is now 
the brighter source, and Sirius is weaker, because its sur¬ 
face is much cooler than the surface of the white dwarf. 
(Photos NASA / HST and Chandra) 


gas is expelled in a shell, but if the mass transfer 
continues in the system further eruptions may oc¬ 
cur, giving rise to recurrent novae. Finally, cata¬ 
clysmic variables without eruptions, for example 
pre-novae or post-novae, are classified as nova¬ 
like variables. 

The dwarf novae, also described in Sect. 14.3, 
are produced by a quite different mechanism. In 
their case the outbursts are not caused by ther¬ 
monuclear reactions, but by instabilities in the 
accretion flow around the white dwarf. Although 
the details of the outburst mechanism are still not 
completely clear, the basic picture is that the disk 
has two possible states, a hot and a cool one, 
available. Under some conditions the disk cannot 
remain permanently in either of these states, and 
has to jump repeatedly between the hot outburst 
state and the cool quiescent state. 

A special type of nova-like variables are the 
magnetic cataclysmic variables. In the polars the 
magnetic field is so strong that the accreted gas 
cannot settle into an accretion disk. Instead it is 
forced to follow the magnetic field lines, form¬ 
ing an accretion column. As the gas hits the sur¬ 
face of the white dwarf it is strongly heated giving 
rise to bright X-ray emission, which is a charac¬ 
teristic feature of polars. Systems with a slightly 
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Fig. 15.2 The structure of 
a neutron star. The crust is 
rigid solid material and the 
mantle a freely streaming 
superfluid 



Surface 
10 7 kg/m 3 


weaker magnetic field are called intermediate po- 
lars. These systems exhibit both X-ray emission 
and variations due to an accretion disk. 


15.2 Neutron Stars 

If the mass of a star is large enough, the density of 
matter may grow even larger than in normal white 
dwarfs. The equation of state of a classical degen¬ 
erate electron gas then has to be replaced with the 
corresponding relativistic formula. In this case 
decreasing the radius of the star no longer helps 
in resisting the gravitational attraction. Equilib¬ 
rium is possible only for one particular value of 
the mass, the Chandrasekhar mass Mch, already 
introduced in Sect. 12.5. The value of Mch is 
about 1.4 Mq, which is thus the upper limit to the 
mass of a white dwarf. If the mass of the star is 
larger than Mch, gravity overwhelms the pressure 
and the star will rapidly contract towards higher 
densities. The final stable state reached after this 
collapse will be a neutron star (Fig. 15.2). On the 
other hand, if the mass is smaller than Mch. the 
pressure dominates. The star will then expand un¬ 
til the density is small enough to allow an equilib¬ 
rium state with a less relativistic equation of state. 

When a massive star reaches the end of its evo¬ 
lution and explodes as a supernova, the simulta¬ 
neous collapse of its core will not necessarily stop 
at the density of a white dwarf. If the mass of the 
collapsing core is larger than the Chandrasekhar 
mass (>1.4 Mq), the collapse continues to a neu¬ 
tron star. 

An important particle reaction during the fi¬ 
nal stages of stellar evolution is the URCA pro¬ 
cess, which was put forward by Schonberg and 
Gamow in the 1940’s and which produces a large 


neutrino emission without otherwise affecting the 
composition of matter. (The URCA process was 
invented in Rio de Janeiro and named after a local 
casino. Apparently money disappeared at URCA 
just as energy disappeared from stellar interiors in 
the form of neutrinos. It is claimed that the casino 
was closed by the authorities when this similarity 
became known.) The URCA process consists of 
the reactions 

(Z,A) + e-—► (Z- 1, A) + v e , 

(Z — 1, A) (Z, A) + e“ + v e , 

where Z is the number of protons in a nucleus; 
A the mass number; e _ an electron; and v e and 
y e the electron neutrino and antineutrino. When 
the electron gas is degenerate, the latter reaction 
is suppressed by the Pauli exclusion principle. In 
consequence the protons in the nuclei are trans¬ 
formed into neutrons. As the number of neutrons 
in the nuclei grows, their binding energies de¬ 
crease. At densities of about 4 x 10 14 kg/m 3 the 
neutrons begin to leak out of the nucleus, and at 
10 17 kg/m 3 the nuclei disappear altogether. Mat¬ 
ter then consists of a neutron “porridge”, mixed 
with about 0.5 % electrons and protons. 

Neutron stars are supported against gravity by 
the pressure of the degenerate neutron gas, just 
as white dwarfs are supported by electron pres¬ 
sure. The equation of state is the same, except 
that the electron mass is replaced by the neu¬ 
tron mass, and that the mean molecular weight is 
defined with respect to the number of free neu¬ 
trons. Since the gas consists almost entirely of 
neutrons, the mean molecular weight is approx¬ 
imately one. 

The typical diameters of neutron stars are 
about 10 km. Unlike ordinary stars they have 
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Fig. 15.3 A rotating neutron star is surrounded by a 
strong magnetic field which drags electrons from the sur¬ 
face and accelerates them to relativistic speeds over the 
magnetic poles. When the electrons are accelerated along 
the magnetic field lines, they radiate so called curvature 
radiation in a narrow beam. Since the magnetic axis is mis¬ 
aligned with the rotation axis, the beams sweep around the 
sky like in a lighthouse. (Lorimer-Kramer 2005, Hand¬ 
book of Pulsar Astronomy, Cambridge University Press, 
p. 55) 

a well-defined solid surface. The atmosphere 
above it is a few centimetres thick. The upper 
crust is a metallic solid with the density grow¬ 
ing rapidly inwards. Most of the star is a neutron 
superfluid, and in the centre, where the density 
exceeds 10 18 kg/m 3 , there may be a solid nu¬ 
cleus of heavier particles (hyperons), or of quark 
matter, where the quarks that normally constitute 
neutrons have become unconhned. 

A neutron star formed in the explosion and 
collapse of a supernova will initially rotate ra¬ 
pidly, because its angular momentum is un¬ 
changed while its radius is much smaller than 
before. In a few hours the star will settle in 
a flattened equilibrium, rotating several hundred 
times per second. The initial magnetic field of the 
neutron star will also be compressed in the col¬ 
lapse, so that there will be a strong held coupling 
the star to the surrounding material. The angu¬ 
lar momentum of the neutron star is steadily de- 



Fig. 15.4 Consecutive radio pulses at 408 MHz from 
two pulsars. To the left PSR 1642-03 and to the right 
PSR 1133+16. Observations made at Jodrell Bank. (Pic¬ 
ture from Smith, F.G. (1977): Pulsars (Cambridge Univer¬ 
sity Press, Cambridge) pp. 93, 95) 

creased by the emission of electromagnetic radia¬ 
tion, neutrinos, cosmic ray particles and possibly 
gravitational radiation. Thus the angular veloc¬ 
ity decreases. The rotation can also break the star 
into several separate objects. They will eventu¬ 
ally recombine when the energy of the system is 
reduced. In some cases the stars can remain sep¬ 
arated, resulting e.g. in a binary neutron star. 

The theory of neutron stars was developed in 
the 1930’s, but the first observations were not 
made until the 1960’s. At that time the pulsars , 
a new type of rapidly pulsating radio sources, 
were discovered and identified as neutron stars. 
In the 1970’s neutron stars were also seen as X- 
ray pulsars, X-ray bursters and magnetars. 

Pulsars The pulsars were discovered in 1967, 
when Anthony Hewish and Jocelyn Bell in Cam¬ 
bridge, England, detected sharp, regular radio 
pulses coming from the sky. Since then about 
1500 pulsars have been discovered (Fig. 15.4). 
Their periods range from 0.0016 s (for the pul¬ 
sar 1937 + 214) up to 20 minutes. 

In addition to the steady slowing down of the 
rotation, sometimes small sudden jumps in the 
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Fig. 15.5 A time- 
sequence of the pulsation 
of the Crab pulsar in visible 
light. The pictures were 
taken once about every 
millisecond; the period of 
the pulsar is about 33 
milliseconds. (Photos N.A. 
Sharp/NOAO/AURA/NSF) 



period are observed. These might be a sign of 
rapid mass movements in the neutron star crust 
(“starquakes”) or in its surroundings. 

The origin of the radio pulses can be under¬ 
stood if the magnetic field is tilted at an angle 
of 45°-90° with respect to the rotation axis. The 
field is so strong that it drags electrons from the 
surface and accelerates them to relativistic speeds 
over the magnetic poles. When the electrons are 
accelerated along the magnetic field lines, they 
radiate so called curvature radiation which is re¬ 
lated to synchrotron radiation (Fig. 15.3). In the 
direction of the magnetic poles two thin beams 
of radio radiation are emitted. The beams sweep 
around the sky, and if the Earth happens to be in 
the path of the beam a pulsar is seen. 

The best-known pulsar is located in the Crab 
nebula (Figs. 15.5 and 15.7). This small neb¬ 
ula in the constellation Taurus was noted by the 
French astronomer Charles Messier in the middle 
of the 18th century and became the first object in 
the Messier catalogue. Ml. The Crab nebula was 
found to be a strong radio source in 1948 and an 
X-ray source in 1964. The pulsar was discovered 
in 1968. In the following year it was observed op¬ 
tically and was also found to be an X-ray emitter. 

Neutron stars are difficult to study optically, 
since their luminosity in the visible region is very 


small (typically about 10 -6 Lq). For instance the 
Vela pulsar has been observed at a visual magni¬ 
tude of about 25. In the radio region, it is a very 
strong pulsating source. 

A few pulsars have been discovered in binary 
systems; the first one, PSR 1913+16, in 1974. 
In 1993 Joseph Taylor and Russell Hulse were 
awarded the Nobel prize for the detection and 
studies of this pulsar. The pulsar orbits about 
a companion, presumably another neutron star, 
with the orbital eccentricity 0.6 and the period 
8 hours. The observed period of the pulses is al¬ 
tered by the Doppler effect, and this allows one to 
determine the velocity curve of the pulsar. These 
observations can be made very accurately, and it 
has therefore been possible to follow the changes 
in the orbital elements of the system over a pe¬ 
riod of several years. For example, the periastron 
direction of the binary pulsar has been found to 
rotate about 4° per year. This phenomenon can 
be explained by means of the general theory of 
relativity; in the solar system, the corresponding 
rotation (the minor fraction of the rotation not 
explained by the Newtonian mechanics) of the 
perihelion of Mercury is 43 arc seconds per cen¬ 
tury. 

The binary pulsar PSR 1913+16 has also pro¬ 
vided the first strong evidence for the existence of 
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gravitational waves. During the time of observa¬ 
tion the orbital period of the system has steadily 
decreased. This shows that the system is losing 
orbital energy at a rate that agrees exactly with 
that predicted by the general theory of relativity. 
The lost energy is radiated as gravitational waves. 

Magnetars The energy emitted by common 
pulsars has its origin in the slowing down of their 
rotation. In some neutron stars, the magnetars, 
the magnetic field is so strong that the energy re¬ 
leased in the decay of the field is the main source 
of energy. Whereas in ordinary pulsars the mag¬ 
netic field is typically 10 9 T, in magnetars a typi¬ 
cal value may be 10 9 —10 11 T. 

Magnetars were first invoked as an explana¬ 
tion of the soft gamma repeaters (SGR), X-ray 
stars that irregularly emit bright, short (0.1 s) re¬ 
peating flashes of low-energy gamma rays. Later 
a second class of mysterious objects, the anoma¬ 
lous X-ray pulsars (AXP), were identified as 
magnetars. AXP are slowly rotating pulsars, with 
a rotation period of 6 to 12 seconds. Despite this 
they are bright X-ray sources, which can be un¬ 
derstood if their energy is of magnetic origin. 

It is thought that magnetars are the remnants 
of stars that were more massive and rapidly ro¬ 
tating than those giving rise to ordinary pulsars, 
although the details are still subject to debate. 
A magnetar first appears as a SGR. During this 
phase, which only lasts about 10,000 years, the 
very strong magnetic field is slowing down the 
rate of rotation. At the same time the field is 
drifting with respect to the neutron star crust. 
This causes shifts in the crust structure, leading 
to powerful magnetic flares and the observed out¬ 
bursts. After about 10,000 years the rotation has 
slowed down so much that the outbursts cease, 
leaving the neutron star observable as an AXP. 

Gamma Ray Bursts For a long time the gam¬ 
ma ray bursts (GRB), very short and sharp 
gamma ray pulses first discovered in 1973, re¬ 
mained a mystery. Unlike the much less common 
SGR, the GRB never recurred, and they had no 
optical or X-ray counterparts. A first major ad¬ 
vance was made when satellite observations with 
the Compton Gamma Ray Observatory showed 


HST: Slo & Sib 



RA (J2000) 

Fig. 15.6 The location of the peculiar type Ibc supernova 
SN 1998bw at redshift z = 0.0085 is in the circle to the 
lower left. This was also the position of the faint gam¬ 
ma-ray burst GRB 980425, the first GRB to be connected 
with a supernova. The circle on the upper right marks an 
ultraluminous X-ray source. (C. Kouveliotou et al. 2004, 
ApJ 608, 872, Fig. 1) 

that the gamma ray bursts are almost uniformly 
distributed in the sky, unlike the known neutron 
stars. 

The nature of the gamma ray bursts is now 
becoming clear thanks to dedicated observing 
programmes that have used burst detections by 
gamma and X-ray satellites such as Beppo-SAX 
and, in particular, Swift rapidly to look for after¬ 
glows of the GRB at optical wavelengths. The de¬ 
tection of these afterglows has made it possible to 
determine distances to the bursts and their loca¬ 
tion in their host galaxies (see Fig. 15.6). 

It has become clear that there are at least two 
kinds of bursts, with the self-descriptive names 
long soft bursts, and short hard bursts. The long 
soft gamma ray bursts, lasting longer than 2 sec¬ 
onds, have now been convincingly shown to be 
produced in the explosions of massive stars at the 
end of their life, specifically supernovae of types 
lb and Ic (Sect. 14.3). Only a small fraction of 
all type Ibc supernovae give rise to a GRB. The 
explosions that produce GRB have been called 
hypernovae, and are among the brightest objects 
in the Universe. A gamma ray burst observed in 
late 2005 took place when the Universe was only 
900 million years old, making it one of the most 
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Fig. 15.7 Some pulsars shine brightly in gamma-rays. In 
the center the Crab pulsar and on the upper left the gamma 
source Geminga, which was identified in 1992 to be the 
nearest pulsar with a distance of about 100 pc from the 
Sun. (Photo by Compton Gamma Ray Observatory) 

distant objects ever observed. The conditions re¬ 
quired for hypernova explosions are still not cer¬ 
tain. 

The nature of the systems giving rise to short 
gamma ray bursts, lasting less than 2 seconds, 
have been more difficult to ascertain. The most 
popular theory has been that they are produced in 
compact binary systems consisting of two neu¬ 
tron stars or a neutron star and a black hole. 
These systems lose energy by gravitational radi¬ 
ation, and eventually the two components should 
merge, producing a burst of gamma radiation. 
This theory has now received strong support 
when the afterglow of a few short bursts has been 
detected in the outer parts of their host galaxies. 
Since the stars in these regions are all old and 
no longer give rise to core-collapse supernovae, 
the neutron star merger hypothesis appears most 
likely. However, it is still also possible that some 
of the short bursts are exceptionally bright mag- 
netar flares. 


Box 15.1 (The Radius of White Dwarfs and 
Neutron Stars) The mass of a white dwarf or 
a neutron star determines its radius. This fol¬ 
lows from the equation of hydrostatic equilib¬ 
rium and from the pressure-density relation for 


a degenerate gas. Using the hydrostatic equilib¬ 
rium equation (11.1) 

dP _ GM r p 
dr r 2 

one can estimate the average pressure P: 

dP _ P M x M/P 3 _ M 2 
d7 ~ P ^ P 2 ~~ P 3 "' 

Here we have used p oc M/P 3 . Thus the pres¬ 
sure obeys 

P oc M 2 /P 4 . (1) 

In the nonrelativistic case, the pressure of a de¬ 
generate electron gas is given by (1 1. 16): 

P (h 2 /m e )(fj, e m H r 5/3 p 5/3 


and hence 


P oc 


p 5 / 3 

5/3 

7«eP-e 


By combining (1) and (2) we obtain 


( 2 ) 


M 2 M 5 / 3 


or 

P OC -j-r oc M -1 / 3 . 

Thus the smaller the radius of a white dwarf is, 
the larger its mass will be. If the density be¬ 
comes so large that the relativistic equation of 
state (11.17) has to be used, the expression for 
the pressure is 


P OC p 4 / 3 oc 


M 4 / 3 

P 4 


As the star contracts, the pressure grows at the 
same rate as demanded by the condition for hy¬ 
drostatic support (1). Once contraction has be¬ 
gun, it can only stop when the state of mat¬ 
ter changes: the electrons and protons com¬ 
bine into neutrons. Only a star that is massive 
enough can give rise to a relativistic degenerate 
pressure. 

The neutrons are fermions, just like the elec¬ 
trons. They obey the Pauli exclusion principle, 
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and the degenerate neutron gas pressure is ob¬ 
tained from an expression analogous to (2): 


Pi i oc 


pV 3 

W n Mn 


where m n is the neutron mass and //„, the 
molecular weight per free neutron. Corre¬ 
spondingly, the radius of a neutron star is given 
by 


Rns OC 


1 

M 1 / 3 m n /tt n ^ 3 


If a white dwarf consists purely of helium, 
/r e = 2; for a neutron star, ~ I. If a white 
dwarf and a neutron star have the same mass, 
the ratio of their radii is 


Rv/d 

Run 


( M ^ 

\Mv/dJ \Me/ m e 
/IX 5/3 

lx(-j X 1840^600. 


Thus the radius of a neutron star is about 1/600 
of that of a white dwarf. Typically K rls is about 
10 km. 


15.3 Black Holes 

If the mass of a star exceeds Mov (Sect. 12.5), 
and if it does not lose mass during its evolu¬ 
tion it can no longer reach any stable final state. 
The force of gravity will dominate over all other 
forces, and the star will collapse to a black hole. 
A black hole is black because not even light can 
escape from it. Already at the end of the 18th 
century Laplace showed that a sufficiently mas¬ 
sive body would prevent the escape of light from 
its surface. According to classical mechanics, the 
escape velocity from a body of radius R and 
mass M is 

\2GM 
Ve ~ \ R 

This is greater than the speed of light, if the radius 
is smaller than the critical radius 

Rs — 2GM/c 2 . (15.1) 


Axis of rotation 



Fig. 15.8 A black hole is surrounded by a spherical event 
horizon. In addition to this a rotating black hole is sur¬ 
rounded by a flattened surface inside which no matter can 
remain stationary. This region is called the ergosphere 


The same value for the critical radius, the Schwa- 
rzschild radius, is obtained from the general the¬ 
ory of relativity. For example, for the Sun, Rs 
is about 3 km; however, the Sun’s mass is so 
small that it cannot become a black hole by nor¬ 
mal stellar evolution. Because the mass of a black 
hole formed by stellar collapse has to be larger 
than Mov the radius of the smallest black holes 
formed in this way is about 5-10 km. 

The properties of black holes have to be stud¬ 
ied on the basis of the general theory of relativity, 
which is beyond the scope of this book. Thus only 
some basic properties are discussed qualitatively. 

An event horizon is a surface through which 
no information can be sent out, even in principle. 
A black hole is surrounded by an event horizon at 
the Schwarzschild radius (Fig. 15.8). In the the¬ 
ory of relativity each observer carries with him 
his own local measure of time. If two observers 
are at rest with respect to each other at the same 
point their clocks go at the same rate. Otherwise 
their clock rates are different, and the apparent 
course of events differs, too. 

Near the event horizon the different time def¬ 
initions become significant. An observer falling 
into a black hole reaches the centre in a finite 
time, according to his own clock, and does not 
notice anything special as he passes through the 
event horizon. However, to a distant observer he 
never seems to reach the event horizon; his ve- 
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locity of fall seems to decrease towards zero as 
he approaches the horizon. 

The slowing down of time also appears as 
a decrease in the frequency of light signals. The 
formula for the gravitational redshift can be writ¬ 
ten in terms of the Schwarzschild radius as (Ap¬ 
pendix B) 

/ 2 GM I R^ 

^ = = (15 - 2) 

Here, v is the frequency of radiation emitted at 
a distance r from the black hole and the fre¬ 
quency observed by an infinitely distant observer. 
It can be seen that the frequency at infinity ap¬ 
proaches zero for radiation emitted near the event 
horizon. 

Since the gravitational force is directed to¬ 
wards the centre of the hole and depends on the 
distance, different parts of a falling body feel 
a gravitational pull that is different in magnitude 
and direction. The tidal forces become extremely 
large near a black hole so that any material falling 
into the hole will be torn apart. All atoms and el¬ 
ementary particles are destroyed near the central 
point, and the final state of matter is unknown to 
present-day physics. The observable properties of 
a black hole do not depend on how it was made. 

Not only all information on the material com¬ 
position disappears as a star collapses into a black 
hole; any magnetic field, for example, also disap¬ 
pears behind the event horizon. A black hole can 
only have three observable properties: mass, an¬ 
gular momentum and electric charge. 

It is improbable that a black hole could have 
a significant net charge. An electrically charged 
black hole would attract particles with opposite 
charge until it became neutral. Rotation, on the 
other hand, is typical to stars, and thus black 
holes, too, must rotate. Since the angular momen¬ 
tum is conserved, stars collapsed to black holes 
must rotate very fast. 

In 1963 Roy Kerr managed to find a solution 
of the field equations for a rotating black hole. In 
addition to the event horizon a rotating hole has 
another limiting surface, an ellipsoidal static limit 
(Fig. 15.8). Objects inside the static limit cannot 
be kept stationary by any force, but they must 
orbit the hole. However, it is possible to escape 


from the region between the static limit and the 
event horizon, called the ergosphere. In fact it is 
possible to utilise the rotational energy of a black 
hole by dropping an object to the ergosphere in 
such a way that part of the object falls into the 
hole and another part is slung out. The outcom- 
ing part may then have considerably more kinetic 
energy than the original object. 

At present the only known way in which 
a black hole could be directly observed is by 
means of the radiation from gas falling into it. 
For example, if a black hole is part of a binary 
system, gas streaming from the companion will 
settle into a disk around the hole. Matter at the 
inner edge of the disk will fall into the hole. The 
accreting gas will lose a considerable part of its 
energy (up to 40 % of the rest mass) as radiation, 
which should be observable in the X-ray region. 

Some rapidly and irregularly varying X-ray 
sources of the right kind have been discovered. 
The first strong evidence for black hole in an 
X-ray binary was for CygnusX-1 (Fig. 15.9). Its 
luminosity varies on the time scale of 0.001 s, 
which means that the emitting region must be 
only 0.001 light-seconds or a few hundred kilo¬ 
metres in size. Only neutron stars and black holes 
are small and dense enough to give rise to such 
high-energy processes. CygnusX-1 is the smaller 
component of the double system HDE 226868. 
The larger component is an optically visible su¬ 
pergiant with a mass 20-25 Mq. The mass of 
the unseen component has been calculated to be 
10-15 Mq. If this is correct, the mass of the sec¬ 
ondary component is much larger than the up¬ 
per limit for a neutron star, and thus it has to be 
a black hole. 

Today 20 such systems are known, where the 
compact component has a mass larger than 3 Mq , 
and therefore is probably a black hole. As shown 
in Fig. 15.10 these can be of very different sizes. 
Nearly all of them have been discovered as X-ray 
novae. 

Many frightening stories about black holes 
have been invented. It should therefore be stressed 
that they obey the same dynamical laws as other 
stars—they are not lurking in the darkness of 
space to attack innocent passers-by. If the Sun 
became a black hole, the planets would continue 
in their orbits as if nothing had happened. 
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Fig. 15.9 The arrow shows the variable star V1357 Cyg. 
Its companion is the X-ray source CygnusX-1, suspected 
to be a black hole. Cyg X-l itself is so faint that it can be 
observed by its X-ray radiation only. The bright star to the 
lower right of V1357 is 11 Cygni, one of the brightest stars 
in the constellation Cygnus 

So far we have discussed only black holes with 
masses in the range of stellar masses. There is 
however no upper limit to the mass of a black 
hole. Many active phenomena in the nuclei of 
galaxies can be explained with supermassive 
black holes with masses of millions or thousands 
of millions solar masses (see Sect. 19.7). 


15.4 X-ray Binaries 

Close binaries where a neutron star or a black 
hole is accreting matter from its companion, usu¬ 
ally a main sequence star, will be visible as 
strong X-ray sources. They are generally classi¬ 
fied as high-mass X-ray binaries (HMXB), when 
the companion has a mass larger than about 
10 Mq, and low-mass X-ray binaries (LMXB) 
with a companion mass smaller than 1.2 Mq. 
In HMXBs the source of the accreted material 


is a strong stellar wind. LMXBs are produced 
by Roche-lobe overflow of the companion star, 
either because the major axis of the binary de¬ 
creases due to angular momentum loss from the 
system, or else because the radius of the compan¬ 
ion is increasing as it evolves. 

Because of the rapid evolution of the massive 
component in HMXBs these systems are young 
and short-lived, 10 5 -10 7 a. In LMXBs the life¬ 
time is determined by the mass-transfer process, 
and may be longer, 10 7 —10 10 a. In many respects 
they are similar to cataclysmic variables, and may 
give rise to analogous phenomena. 

Many kinds of variable X-ray sources have 
been discovered since they were first observed in 
the 1970’s. Among these, the X-ray pulsars and 
the X-ray bursters can only be neutron stars. In 
other types of X-ray binaries it can be difficult to 
determine whether the primary is a neutron star 
or a black hole. 

Neutron stars and black holes are formed in 
supernova explosions, and in a binary system the 
explosion would normally be expected to disrupt 
the binary. An X-ray binary will only form under 
special conditions. Some examples are shown in 
Sect. 12.6. 

X-ray Pulsars X-ray pulsars always belong to 
binary systems and may be either HMXBs or 
LMXBs. The pulse periods of X-ray pulsars in 
high-mass systems are significantly longer than 
those of radio pulsars, from a few seconds to tens 
of minutes. In contrast to radio pulsars, the period 
of the pulsed emission of these pulsars decreases 
with time. 

The characteristic properties of X-ray pul¬ 
sars can be understood from their binary nature. 
A neutron star formed in a binary system is first 
seen as a normal radio pulsar. Initially, the strong 
radiation of the pulsar prevents gas from falling 
onto it. However, as it slows down, its energy de¬ 
creases, and eventually the stellar wind from the 
companion can reach its surface. The incoming 
gas is channelled to the magnetic polar caps of 
the neutron star, where it emits strong X-ray ra¬ 
diation as it hits the surface. This produces the 
observed pulsed emission. 

In low-mass systems, the angular momentum 
of the incoming gas speeds up the rotation of 
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Fig. 15.10 Scale drawings 
of 16 black-hole binaries in 
the Milky Way (courtesy of 
J. Orosz). The 
Sun-Mercury distance 
(0.4 AU) is shown at the 
top. The estimated binary 
inclination is indicated by 
the tilt of the accretion 
disk. (R.A. Remillard, 

J.E. McClintock 2006, 
ARAA 44, 54) 



the pulsar. The maximum possible rotation rate 
of a neutron star before centrifugal forces start to 
break it up corresponds to a period of about a mil¬ 
lisecond. A few millisecond pulsars with periods 
of this order are known, both in the radio and in 
the X-ray region. It is thought that these are (or, 
in the radio case, have once been) members of bi¬ 
nary systems. 

The emission curve of a typical fast X-ray pul¬ 
sar, HerculesXl, is shown in Fig. 15.11. The pe¬ 
riod of the pulses is 1.24 s. This neutron star is 
part of an eclipsing binary system, known from 
optical observations as HZHerculis. The orbital 
properties of the system can therefore be deter¬ 
mined. Thus e.g. the mass of the pulsar is about 
one solar mass, reasonable for a neutron star. 

X-ray Bursters X-ray bursters are irregular 
variables, showing sudden brightenings, known 
as type I X-ray bursts, at random times 
(Fig. 15.12). The typical interval between out¬ 
bursts is a few hours or days, but more rapid 


bursters are also known. The strength of the out¬ 
burst seems to be related to the recharging time. 

Type I X-ray bursts are analogous to the erup¬ 
tions of classical novae. However, the source of 
radiation in X-ray bursters cannot be the igni¬ 
tion of hydrogen, since the maximum emission 
is in the X-ray region. Instead, gas from the com¬ 
panion settles on the surface of the neutron star, 
where hydrogen burns steadily to helium. Then, 
when the growing shell of helium reaches a criti¬ 
cal temperature, it burns to carbon in a rapid he¬ 
lium flash. Since, in this case, there are no thick 
damping outer layers, the flash appears as a burst 
of X-ray radiation. 

X-ray Novae The X-ray pulsars and bursters 
have to be neutron stars. Other X-ray binaries 
may be either neutron stars or black holes. All 
compact X-ray sources are variable to some ex¬ 
tent. In the persistent sources the variations are 
moderate, an the sources always visible. The ma¬ 
jority of sources are transient. 
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Fig. 15.11 The pulses of 
the X-ray pulsar 
Hercules XI have the 
period 1.24 s. The 
best-fitting curve has been 
superimposed on the 
observations. (Tananbaum, 
H. et al. (1972): Astrophys. 
J. (Lett.) 174, L143) 



Fig. 15.12 The variations 
of the rapid X-ray burster 
MXB 1730-335. An 
100 second interval is 
marked in the diagram. 
(Lewin, W.H.G. (1977): 
Ann. N.Y. Acad. Sci. 302, 
310) 
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If the X-ray bursters correspond to classical 
novae, the counterparts of dwarf novae are the 
X-ray novae, also known as soft X-ray transients 
(SXT). Quantitatively there are large differences 
between these types of systems. Dwarf novae 
have outbursts lasting for a few days at intervals 
of a few months, for SXTs the outbursts hap¬ 
pen at decade-long intervals and last for months. 
A dwarf nova brightens by a factor about 100 dur¬ 
ing outbursts, a SXT by a factor of 10 6 . The light- 
curves of neutron-star and black-hole SXTs are 
compared in Fig. 15.13. 


The SXTs are alternating between (at least) 
two states: During the high state thermal radia¬ 
tion from the accretion disk dominates, whereas 
in the low state the X-ray have a higher energy, 
and are produced by Compton scattering by hot 
electrons in a disk corona or a jet. 

Microquasars One interesting aspect of X-ray 
binaries is their connection to models of active 
galactic nuclei (AGN, Sect. 19.7). In both sys¬ 
tems a black hole, which in the case of AGN may 
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have a mass in the range 10 6 -10 10 Mq, is sur¬ 
rounded by an accretion disk. 

In an X-ray binary there is similarly an accre¬ 
tion disk surrounding a compact object, a stellar- 
mass black hole. It will exhibit phenomena in 
many respects similar to those in AGN. Since 
the galactic sources are much nearer, and vary on 
much shorter time-scales, they may allow more 
detailed observations of these phenomena. 



Fig. 15.13 Light-curves of a neutron-star (Aql X- 1 ) and a 
black-hole (GRO J1655—40) transient source, as observed 
by the All Sky Monitor on RXTE. (D. Psaltis 2006, in 
Compact Stellar X-ray Sources, ed. Lewin, vdKlis, CUP, 
p. 16, Fig. 1.9) 


For example, relativistic jets perpendicular to 
the disk are common in AGN, and they can also 
be expected in X-ray binaries. A few examples 
of such microquasars have been discovered, see 
Fig. 15.14. 

Furthermore, in AGN the jet may sometimes 
be pointing straight at us. Relativistic effects will 
then lead to a brightening of the source. In a mi¬ 
croquasar there might be a similar effect, which 
would provide one explanation for the ultralu- 
minous X-ray sources (ULX), sources which ap¬ 
pear to be too luminous to be produced by or¬ 
dinary stellar-mass black holes. This is impor¬ 
tant, because according to an alternative model 
ULXs contain an intermediate mass black hole 
with a mass about 10 3 Mq. The origin of such in¬ 
termediate mass black holes, if they exist, is an 
intriguing problem. 


15.5 Examples 

Example 15.1 Assume that the Sun collapses 
into a neutron star with a radius of 20 km. 
(a) What will be the mean density of the neutron 
star? (b) What would be it rotation period? 


Fig. 15.14 Observed 
outburst of the microquasar 
GRS 1915+105 on 
9 September, 1997. The 
disappearance of the 
internal part of the 
accretion disc (decrease in 
the X-ray flux) is followed 
by an ejection of 
relativistic plasma clouds 
(oscillation in the infrared 
and radio). (S. Chaty, 
astro-ph/0607668) 
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(a) The mean density is 

M q 2 x 10 30 kg 

^ tR 3 jjr(20 x 10 3 ) 3 m 3 

«6x 10 16 kg/m 3 . 


For the Sun we have 


Rs = 


2x6.67x 10 -11 m 3 s _2 kg _l x 1.989x 10 30 kg 
(2.998xlO 8 ms" 1 ) 2 


= 2950 m. 


One cubic millimetre of this substance would 
weigh 60 million kilos. 

(b) To obtain an exact value, we should take 
into account the mass distributions of the Sun and 
the resulting neutron star. Very rough estimates 
can be found assuming that both are homoge¬ 
neous. Then the moment of inertia is I = ^ MR 2 , 
and the angular momentum is L = I o>. The rota¬ 
tion period is then obtained as in Example 12.1: 


P = 


Pq 



2 


= 25 d 


20 x 10 3 m \ 2 
6.96 x 10 8 m ) 


= 2.064 x 10“ 8 


d 


^ 0.0018 s. 


The Sun would make over 550 revolutions per 
second. 


Example 15.2 What should be the radius of the 
Sun if the escape velocity from the surface were 
to exceed the speed of light? 

The escape velocity exceeds the speed of light 
if 


or 


2 GM 


R 


> c 


R < 


2 GM 


= Rs- 


15.6 Exercises 

Exercise 15.1 The mass of a pulsar is 1.5 Mq, 
radius 10 km, and rotation period 0.033 s. What 
is the angular momentum of the pulsar? Varia¬ 
tions of 0.0003 s are observed in the period. If 
they are due to radial oscillations (“starquakes”), 
how large are these oscillations? 

Exercise 15.2 In Dragon’s Egg by Robert L. 
Forward a spaceship orbits a neutron star at a dis¬ 
tance of 406 km from the centre of the star. The 
orbital period is the same as the rotation period of 
the star, 0.1993 s. 

(a) Find the mass of the star and the gravitational 
acceleration felt by the spaceship. 

(b) What is the effect of the gravitation on 
a 175 cm tall astronaut, if (s)he stands with 
her/his feet pointing towards the star? And if 
(s)he is lying tangential to the orbit? 

Exercise 15.3 A photon leaves the surface of 
a star at a frequency i> e . An infinitely distant ob¬ 
server finds that its frequency is v. If the differ¬ 
ence is due to gravitation only, the change in the 
energy of the photon, h An, equals the change in 
its potential energy. Find the relation between v 
and v e , assuming the mass and radius of the star 
are M and R. How much will the solar radiation 
redshift on its way to the Earth? 
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Although most of the mass of the Milky Way 
Galaxy is condensed into stars, interstellar space 
is not completely empty. It contains gas and dust 
in the form both of individual clouds and of a dif¬ 
fuse medium. Interstellar space typically contains 
about one gas atom per cubic centimetre and 
100 dust particles per cubic kilometre. 

Altogether, about 10 % of the mass of the 
Milky Way consists of interstellar gas. Since the 
gas is strongly concentrated in the galactic plane 
and the spiral arms, in these regions there are 
many places where the quantities of stars and in¬ 
terstellar matter are about equal. The dust (a bet¬ 
ter name would be “smoke”, since the particle 
sizes are much smaller than in terrestrial dust) 
constitutes about one percent of the gas. High- 
energy cosmic ray particles are mixed with the 
gas and dust. There is also a weak, but still very 
important, galactic magnetic field. 

At present the most important observations of 
the interstellar medium are made at radio and in¬ 
frared wavelengths, since the peak of the emis¬ 
sion often lies at these wavelengths. But many 
forms of interstellar matter (such as solid bod¬ 
ies with diameters larger than 1 mm) would be 
almost impossible to detect on the basis of their 
emission or absorption. In principle, the mass of 
these forms of matter might be larger than the ob¬ 
served mass of all other forms put together. How¬ 
ever, an upper limit on the total mass of interstel¬ 
lar matter, regardless of its form, can be derived 
on the basis of its gravitational effects. This is 
the Oort limit. The galactic gravitational field is 
determined by the distribution of matter. By ob¬ 


serving the motions of stars perpendicular to the 
galactic plane, the vertical gravitational force and 
hence the amount of mass in the galactic plane 
can be determined. The result is that the local 
density within 1 kpc of the Sun is (7.3-10.0) x 
10 -21 kgm -3 . The density of known stars is 
(5.9-6.7) x 10 -21 kg m~ 2 and that of known in¬ 
terstellar matter about 1.7 x 10 -21 kgm -3 . Thus 
there is very little room for unknown forms of 
mass in the solar neighbourhood. However, the 
limit concerns only the dark matter concentrated 
in the galactic plane. There are indications that 
the Milky Way is surrounded by a spherical halo 
of dark matter (Chap. 18). 


16.1 Interstellar Dust 

The first clear evidence for the existence of in¬ 
terstellar dust was obtained around 1930. Before 
that, it had been generally thought that space is 
completely transparent and that light can propa¬ 
gate indefinitely without extinction. 

In 1930 Robert Trumpler published his study 
of the space distribution of the open clusters. 
The absolute magnitudes M of the brightest stars 
could be estimated on the basis of the spectral 
type. Thus the distance r to the clusters could 
be calculated from the observed apparent magni¬ 
tudes m of the bright stars: 

m-M = 51g —^—. (16.1) 

10 pc 

Trumpler also studied the diameters of the clus¬ 
ters. The linear diameter D is obtained from the 
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apparent angular diameter d by means of the for¬ 
mula 


D — dr , (16.2) 

where r is the distance of the cluster. 

It caught Trumpler’s attention that the more 
distant clusters appeared to be systematically 
larger than the nearer ones (Fig. 16.1). Since this 
could hardly be true, the distances of the more 
distant clusters must have been overestimated. 
Trumpler concluded that space is not completely 
transparent, but that the light of a star is dimmed 
by some intervening material. To take this into 
account, (16.1) has to be replaced with (4.17) 

m-M — 5 lg — 7 --hA, (16.3) 

10 pc 

where A > 0 is the extinction in magnitudes due 
to the intervening medium. If the opacity of the 
medium is assumed to be the same at all distances 
and in all directions, A can be written 

A = ar, (16.4) 

where a is a constant. Trumpler obtained for the 
average value of a in the galactic plane, <7 pg = 
0.79 mag/kpc, in photographic magnitudes. At 
present, a value of 2 mag/kpc is used for the av¬ 
erage extinction. Thus the extinction over a 5 kpc 
path is already 10 magnitudes. 

Extinction due to dust varies strongly with 
direction. For example, visible light from the 



1 2 3 

Distance [kpc] 


Fig. 16.1 The diameters of open star clusters calculated 
with the distance given by the formula (15.1) according 
to Trumpler (1930). The increase of the diameter with dis¬ 
tance is not a real phenomenon, but an effect of interstellar 
extinction, which was discovered in this way 


galactic centre (distance 8-9 kpc) is dimmed by 
30 magnitudes. Therefore the galactic centre can¬ 
not be observed at optical wavelengths. 

Extinction is due to dust grains that have diam¬ 
eters near the wavelength of the light. Such par¬ 
ticles scatter light extremely efficiently. Gas can 
also cause extinction by scattering, but its scatter¬ 
ing efficiency per unit mass is much smaller. The 
total amount of gas allowed by the Oort limit is so 
small that scattering by gas is negligible in inter¬ 
stellar space. (This is in contrast with the Earth’s 
atmosphere, where air molecules make a signifi¬ 
cant contribution to the total extinction). 

Interstellar particles can cause extinction in 
two ways: 

1. In absorption the radiant energy is trans¬ 
formed into heat, which is then re-radiated 
at infrared wavelengths corresponding to the 
temperature of the dust particles. 

2. In scattering the direction of light propagation 
is changed, leading to a reduced intensity in 
the original direction of propagation. 

An expression for interstellar extinction will 
now be derived. The size, index of refraction and 
number density of the particles are assumed to 
be known. For simplicity we shall assume that all 
particles are spheres with the same radius a and 
the geometrical cross section jt a 2 . The true ex¬ 
tinction cross section of the particles C ex t will be 

Cext = Gexfra 2 , (16.5) 


where Q ext is the extinction efficiency factor. 

Let us consider a volume element with length 
d/ and cross section dA, normal to the direction 
of propagation (Fig. 16.2). It is assumed that the 
particles inside the element do not shadow each 
other. If the particle density is n, there are n d/ dA 
particles in the volume element and they will 
cover the fraction dr of the area dA, where 


dr = 


ndAdl Cext 
dA 


— It Cext d/. 


In the length dl the intensity is thus changed by 


dl — —I dr. (16.6) 


On the basis of (16.6) dr can be identified as the 
optical depth. 
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d / 



Fig. 16.2 Extinction by a distribution of particles. In the 
volume element with length dZ and cross section dA, there 
are n dA d/ particles, where n is the particle density in the 
medium. If the extinction cross section of one particle is 

The total optical depth between the star and 
the Earth is 

r(r)= / dr = / nC ext dl = C ext nr, 

Jo Jo 

where n is the average particle density along the 
given path. According to (4.18) the extinction in 
magnitudes is 

A = (2.5 lge)r, 

and hence 

A(r) — (2.51ge)C ext nr. (16.7) 

This formula can also be inverted to calculate n, 
if the other quantities are known. 

The extinction efficiency factor Q ext can be 
calculated exactly for spherical particles with 
given radius a and refractive index m. In general, 

Qext 1 — Q ab> + Q sca> 

where 

<2 abs = absorption efficiency factor, 

Q sca = scattering efficiency factor. 

If we define 

x = 2ita/X, (16.8) 

where X is the wavelength of the radiation, then 

(16.9) 


C ex t. the total area covered by the particles is n dA dZ C ex t- 
Thus the fractional decrease in intensity over the distance 
d/ is d I/I = —n dAd/ C ex t/dA = —n C ex tdZ 



Fig. 16.3 Mie scattering: the extinction efficiency factor 
for spherical particles for the refractive indices m = 1.5 
and m = 1.33 (refractive index of water). The horizon¬ 
tal axis is related to the size of the particle according to 
x = 2na/X, where a is the particle radius and X, the wave¬ 
length of the radiation 

The exact expression for Q ext is a series ex¬ 
pansion in x that converges more slowly for 
larger values of x. When r « 1, the process is 
called Rayleigh scattering-, otherwise it is known 
as Mie scattering. Figure 16.3 shows Qext as 
a function of x for m = 1.5 and m = 1.33. For 
very large particles, (x 5 s> 1 ) Q ext = 2 , as appears 
from Fig. 16.3. Purely geometrically one would 
have expected Q C xt = 1 ; the two times larger scat¬ 
tering efficiency is due to the diffraction of light 
at the edges of the particle. 

Other observable phenomena, apart from ex¬ 
tinction, are also caused by interstellar dust. One 
of these is the reddening of the light of stars. 
(This should not be confused with the redshift 
of spectral lines.) Reddening is due to the fact 


Qext — Qc\t( V. 777 ). 
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Fig. 16.4 (a) Schematic representation of the interstel¬ 
lar extinction. As the wavelength increases, the extinc¬ 
tion approaches zero. (Drawing based on Greenberg, J.M. 
(1968): “Interstellar Grains”, in Nebulae and Interstel¬ 
lar Matter, ed. by Middlehurst, B.M., Aller, L.H., Stars 


and Stellar Systems, Vol. VII (The University of Chicago 
Press, Chicago) p. 224). (b) Measured extinction curve, 
normalised to make £b-v = 1. (Hoyle, F., Narlikar, J. 
(1980): The Physics-Astronomy Frontier (W.H. Freeman 
and Company, San Francisco) p. 156. Used by permission) 


that the amount of extinction becomes larger for 
shorter wavelengths. Going from red to ultravi¬ 
olet, the extinction is roughly inversely propor¬ 
tional to wavelength. For this reason the light of 
distant stars is redder than would be expected on 
the basis of their spectral class. The spectral class 
is defined on the basis of the relative strengths of 
the spectral lines which are not affected by ex¬ 
tinction. 

According to (4.20), the observed colour index 
B — V of a star is 


B — V = Mg — My + Ab — Ab 

= (B-V)o + E B - Y , (16.10) 


where (B — V)o is the intrinsic colour of the star 
and F’b-v the colour excess. As noted in Sect. 4.5 
the ratio between the visual extinction Ay and the 
colour excess is approximately constant: 


Ay Ay 

Eb-v A b — Ay 


3.0. 


(16.11) 


R does not depend on the properties of the star 
or the amount of extinction. This is particularly 
important in photometric distance determinations 
because of the fact that the colour excess £b-v 
can be directly determined from the difference 
between the observed colour index B — V and the 
intrinsic colour (B — F)o known from the spec¬ 


tral class. One can then calculate the extinction 

Ay ss 3.0 £ B -v (16.12) 

and finally the distance. Since the interstellar 
medium is far from homogeneous, the colour ex¬ 
cess method gives a much more reliable value 
than using some average value for the extinction 
in (4.18). 

The wavelength dependence of the extinction, 
A (a), can be studied by comparing the magni¬ 
tudes of stars of the same spectral class in differ¬ 
ent colours. These measurements have shown that 
A (a) approaches zero as A becomes very large. 
In practice A (A) can be measured up to a wave¬ 
length of about two micrometres. The extrapola¬ 
tion to zero inverse wavelength is then fairly re¬ 
liable. Figure 16.4(a) shows A(A) as a function 
of inverse wavelength. It also illustrates how the 
quantities Ay and £b-v, which are needed in or¬ 
der to calculate the value of R, are obtained from 
this extinction or reddening curve. Figure 16.4(b) 
shows the observed extinction curve. The points 
in the ultraviolet (A < 0.3 m) are based on rocket 
measurements. 

It is clear from Fig. 16.4(b) that interstellar ex¬ 
tinction is largest at short wavelengths in the ul¬ 
traviolet and decreases for longer wavelengths. In 
the infrared it is only about ten percent of the op¬ 
tical extinction and in the radio region it is vanish¬ 
ingly small. Objects that are invisible in the opti- 
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cal region can therefore be studied at infrared and 
radio wavelengths. 

Another observed phenomenon caused by dust 
is the polarisation of the light of the stars. Since 
spherical particles cannot produce any polarisa¬ 
tion, the interstellar dust particles have to be non- 
spherical in shape. If the particles in a cloud are 
aligned by the interstellar magnetic field, they 
will polarise the radiation passing through the 
cloud. The degree of polarisation and its wave¬ 
length dependence give information on the prop¬ 
erties of the dust particles. By studying the di¬ 
rection of polarisation in various directions, one 
can map the structure of the galactic magnetic 
field. 

In the Milky Way interstellar dust is essen¬ 
tially confined to a very thin, about 100 pc, layer 
in the galactic plane. The dust in other spiral 
galaxies has a similar distribution and is directly 
visible as a dark band in the disk of the galaxy 
(Fig. 19.17 bottom). The Sun is located near 
the central plane of the galactic dust layer, and 
thus the extinction in the direction of the galac¬ 
tic plane is very large, whereas the total extinc¬ 
tion towards the galactic poles may be less than 
0.1 magnitudes. This is apparent in the distribu¬ 
tion of galaxies in the sky: at high galactic lat¬ 
itudes, there are many galaxies, while near the 
galactic plane, there is a 20° zone where hardly 
any galaxies are seen. This empty region is called 
the zone of avoidance. 

If a homogeneous dust layer gives rise to a to¬ 
tal extinction of Am magnitudes in the vertical 
direction, then according to Fig. 16.5, the total 
extinction at galactic latitude b will be 

Am(b) — Am/ sini>. (16.13) 

If the galaxies are uniformly distributed in space, 
then in the absence of extinction, the number of 
galaxies per square degree brighter than the mag¬ 
nitude m would be 

lgNo(m) = 0.6m + C, (16.14) 

where C is a constant (see Exercise 17.1). How¬ 
ever, due to extinction, a galaxy that would oth¬ 
erwise have the apparent magnitude /«o will have 



Fig. 16.5 In a homogeneous medium the extinction in 
magnitudes is proportional to the pathlength traversed. 
If the extinction in the direction of the galactic pole is 
Am, then the extinction at the galactic latitude b will be 
Am/ sin I? 

the magnitude 

m ( b ) = ;«o + Am ( b) — mo + Am/ sin b, 

(16.15) 

where b is the galactic latitude. Thus the observ¬ 
able number of galaxies at latitude b will be 

lg N(m, b) — lg No(m — Am{b)) 

= 0.6 (m — A m(bf) + C 
= lg No(m) — 0.6A m(b) 

or 

\gN(m,b) = C' -0.6—, (16.16) 

sin b 

where C' = lgNo(m) does not depend on the 
galactic latitude. By making galaxy counts at var¬ 
ious latitudes b, the extinction Am can be de¬ 
termined. The value obtained from galaxy counts 
made at Lick Observatory is Am pg = 0.51 mag. 

The total vertical extinction of the Milky Way 
has also been determined from the colour ex¬ 
cesses of stars. These investigations have yielded 
much smaller extinction values, about 0.1 mag. In 
the direction of the north pole, extinction is only 
0.03 mag. The disagreement between the two ex¬ 
tinction values is probably largely due to the fact 
that the dust layer is not really homogeneous. If 
the Sun is located in a local region of low dust 
content, the view towards the galactic poles might 
be almost unobstructed by dust. 
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Fig. 16.6 The Coalsack is a dark nebula next to the Southern Cross. (Photograph K. Mattila, Helsinki University) 


Dark Nebulae Observations of other galaxies 
show that the dust is concentrated in the spi¬ 
ral arms, in particular at their inner edge. In ad¬ 
dition dust is concentrated in individual clouds, 
which appear as star-poor regions or dark nebulae 
against the background of the Milky Way. Exam¬ 
ples of dark nebulae are the Coalsack in the south¬ 
ern sky (Fig. 16.6) and the Horsehead nebula 
in Orion. Sometimes the dark nebulae form ex¬ 
tended winding bands, and sometimes small, al¬ 
most spherical, objects. Objects of the latter type 
are most easy to see against a bright background, 
e.g. a gas nebula (see Fig. 16.19). These objects 
have been named globules by Bart J. Bok, who 


put forward the hypothesis that they are clouds 
that are just beginning to contract into stars. 

The extinction by a dark nebula can be illus¬ 
trated and studied by means of a Wolf diagram, 
shown schematically in Fig. 16.7. The diagram is 
constructed on the basis of star counts. The num¬ 
ber of stars per square degree in some magnitude 
interval (e.g. between magnitudes 14 and 15) in 
the cloud is counted and compared with the num¬ 
ber outside the nebula. In the comparison area, 
the number of stars increases monotonically to¬ 
wards fainter magnitudes. In the dark nebula the 
numbers first increase in the same way, but be¬ 
yond some limiting magnitude (10 in the figure) 
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Fig. 16.7 Wolf diagram. The horizontal coordinate is the 
magnitude and the vertical coordinate is the number of 
stars per square degree in the sky brighter than that mag¬ 
nitude. A dark nebula diminishes the brightness of stars 
lying behind it by the amount Am 

the number of stars falls below that outside the 
cloud. The reason for this is that the fainter stars 
are predominantly behind the nebula, and their 
brightness is reduced by some constant amount 
Am (2 magnitudes in the figure). The brighter 
stars are mostly in front of the nebula and suffer 
no extinction. 

Reflection Nebulae If a dust cloud is near 
a bright star, it will scatter, i.e. reflect the light 
of the star. Thus individual dust clouds can some¬ 
times be observed as bright reflection nebulae. 
Some 500 reflection nebulae are known. 

The regions in the sky richest in reflection 
nebulae are the areas around the Pleiades and 
around the giant star Antares. Antares itself is 
surrounded by a large red reflection nebula. This 
region is shown in Fig. 16.8. Figure 16.9 shows 
the reflection nebula NGC 2068, which is located 
near a large, thick dust cloud a few degrees north¬ 
west of Orion’s belt. It is one of the brightest 
reflection nebulae and the only one included in 
the Messier catalogue (M78). In the middle of 
the nebula there are two stars of about 11 mag¬ 
nitudes. The northern star illuminates the nebula, 
while the other one is probably in front of the 
nebula. Figure 16.10 shows the reflection nebula 


NGC 1435 around Merope in the Pleiades. An¬ 
other bright and much-studied reflection nebula is 
NGC 7023 in Cepheus. It, too, is connected with 
a dark nebula. The illuminating star has emission 
lines in its spectrum (spectral type Be). Infrared 
stars have also been discovered in the area of the 
nebula, probably a region of star formation. 

In 1922 Edwin Hubble published a fundamen¬ 
tal investigation of bright nebulae in the Milky 
Way. On the basis of extensive photographic and 
spectroscopic observations, he was able to estab¬ 
lish two interesting relationships. First he found 
that emission nebulae only occur near stars with 
spectral class earlier than B0, whereas reflec¬ 
tion nebulae may be found near stars of spectral 
class B1 and later. Secondly Hubble discovered 
a relationship between the angular size R of the 
nebula and the apparent magnitude m of the illu¬ 
minating star: 

5 lg R — — m + const. (16.17) 

Thus the angular diameter of a reflection neb¬ 
ula is larger for a brighter illuminating star. Since 
the measured size of a nebula generally increases 
for longer exposures, i.e. fainter limiting surface 
brightness, the value of R should be defined to 
correspond to a fixed limiting surface brightness. 
The value of the constant in the Hubble relation 
depends on this limiting surface brightness. The 
Hubble relation for reflection nebulae is shown in 
Fig. 16.11, based on measurements by Sidney van 
den Bergh from Palomar Sky Atlas plates. Each 
point corresponds to a reflection nebula and the 
straight line represents the relation (16.17), where 
the value of the constant is 12.0 (R is given in arc 
minutes). 

The Hubble relation can be derived theo¬ 
retically, if it is assumed that the illumination 
of a dust cloud is inversely proportional to the 
square of the distance to the illuminating star, and 
that the dust clouds are uniformly distributed in 
space. The theoretical Hubble relation also gives 
an expression for the constant on the right-hand 
side, which involves the albedo and the phase 
function of the grains. 

The observations of reflection nebulae show 
that the albedo of interstellar grains must be quite 
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Fig. 16.8 Bright and dark nebulae in Scorpius and Ophi- 
uchus. Photograph (a) was taken in the blue colour re¬ 
gion, X = 350-500 nm, and (b) in the red colour region, 
X = 600-680 nm. (The sharp rings in (b) are reflections 
of Antares in the correction lens of the Schmidt camera.) 
The nebulae located in the area are identified in draw¬ 
ing (c). B44 and H4 are dark nebulae. There is a large 
reflection nebula around Antares, which is faintly visible 
in the blue (a), but bright in the red (b) regions. Antares 
is very red (spectral class Ml) and therefore the reflec- 

high. It has not yet been possible to obtain its pre¬ 
cise numerical value in this way, since the dis¬ 
tances between the nebulae and their illuminating 
stars are not known well enough. 

One may also consider the surface brightness 
of dark nebulae that are not close enough to a star 
to be visible as reflection nebulae. These nebulae 
will still reflect the diffuse galactic light from all 
the stars in the Milky Way. Calculations show that 
if the dust grains have a large albedo, then the 


tion nebula is also red. In contrast, the reflection nebu¬ 
lae around the blue stars p Ophiuchi (B2), CD-24° 12684 
(B3), 22 Scorpii (B2) and cr Scorpii (Bl) are blue and are 
visible only in (a). In (b) there is an elongated nebula to 
the right of a Scorpii, which is invisible in (a). This is an 
emission nebula, which is very bright in the red hydro¬ 
gen H„ line (656 nm). In this way reflection and emission 
nebulae can be distinguished by means of pictures taken in 
different wavelength regions. (Photograph (a) E. Barnard, 
and (b) K. Mattila) 

reflected diffuse light should be bright enough to 
be observable, and it has indeed been observed. 
Thus the dark nebulae are not totally dark. The 
diffuse galactic light constitutes about 20-30 % 
of the total brightness of the Milky Way. 

Dust Temperature In addition to scattering 
the interstellar grains also absorb radiation. The 
absorbed energy is re-radiated by the grains 
at infrared wavelengths corresponding to their 
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Fig. 16.9 The reflection 
nebula NGC 2068 (M78) in 
Orion. In the middle of the 
nebula there are two stars 
of about magnitude 11. The 
northern one (at the top) is 
the illuminating star, while 
the other one probably lies 
in the foreground. 
(Photography Lunar and 
Planetary Laboratory, 
Catalina Observatory) 





Fig. 16.10 The reflection 
nebula NGC 1435 around 
Merope (23 Tau, spectral 
class B6) in the Pleiades. 
This figure should be 
compared with Fig. 16.1, 
where Merope is visible as 
the lowest of the bright 
stars in the Pleiades. 
(National Optical 
Astronomy Observatories, 
Kitt Peak National 
Observatory) 
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Nebular radius R [Arcminutes] 


Fig. 16.11 The Hubble relation for reflection nebulae. 
The horizontal axis shows the radius R of the nebulae in 
arc minutes and the vertical axis, the (blue) apparent mag¬ 
nitude m of the central star. No measurements were made 
in the hatched region, (van den Bergh, S. (1966): Astron. 
J. 71, 990) 


temperatures. The temperature of dust in inter¬ 
stellar space (including dark nebulae) is about 
10-20 K. The corresponding wavelength ac¬ 
cording to Wien’s displacement law (5.21) is 
300-150 pm. Near a hot star the temperature of 
the dust may be 100-600 K and the maximum 
emission is then at 30-5 pm. In HII regions the 
dust temperature is about 70-100 K. 

The rapid development of infrared astronomy 
in the 1970’s has brought all the above-mentioned 
dust sources within the reach of observations 
(Fig. 16.12). In addition infrared radiation from 
the nuclei of normal and active galaxies is largely 
thermal radiation from dust. Thermal dust emis¬ 
sion is one of the most important sources of in¬ 
frared radiation in astronomy. 



Fig. 16.12 Interstellar dust is best seen in infrared wave¬ 
lengths. Two examples of the images by the IRAS satel¬ 
lite. (a) In a view towards the Galactic centre, the dust is 
seen to be concentrated in a narrow layer in the galactic 
plane. Several separate clouds are also seen, (b) Most of 


the constellation Orion is covered by a complex area of in¬ 
terstellar matter. The densest concentrations of dust below 
the centre of the image, are in the region of the Horsehead 
nebula and the Orion nebula. (Photos NASA) 
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Wavelength A 

Fig. 16.13 More than 99 % of the radiation from the 
tj Carinae nebula (Fig. 13.10) is in the infrared. The 
peak in the visual region is from the hydrogen H a line 
(0.66 pm). In the infrared, the silicate emission from dust 
is evident at 10 pm. (Allen, D.A. (1975): Infrared, the New 
Astronomy (Keith Reid Ltd., Shaldon) p. 103) 

One of the strongest infrared sources in the 
sky is the nebula around the star t] Carinae. The 
nebula consists of ionised gas, but infrared radi¬ 
ation from dust is also clearly visible in its spec¬ 
trum (Fig. 16.13). In even more extreme cases, 
the central star may be completely obscured, 
but revealed by the infrared emission from hot 
dust. 

Composition and Origin of the Dust (Ta¬ 
bles 16.1 and 16.2). From the peaks in the ex¬ 
tinction curve, it may be concluded that inter¬ 
stellar dust contains water ice and silicates, and 
probably graphite as well. The sizes of the grains 
can be deduced from their scattering properties; 
usually they are smaller than one micrometre. 
The strongest scattering is due to grains of about 
0.3 pm but smaller particles must also be present. 

Dust grains are formed in the atmospheres of 
stars of late spectral types (K, M). Gas condenses 


into grains just as water in the Earth’s atmosphere 
may condense into snow and ice. The grains are 
then expelled into interstellar space by the ra¬ 
diation pressure. Grains may also form in con¬ 
nection with star formation and possibly directly 
from atoms and molecules in interstellar clouds 
as well. 


16.2 Interstellar Gas 

The mass of gas in interstellar space is a hundred 
times larger than that of dust. Although there is 
more gas, it is less easily observed, since the gas 
does not cause a general extinction of light. In the 
optical region it can only be observed on the basis 
of a small number of spectral lines. 

The existence of interstellar gas began to be 
suspected in the first decade of the 20th cen¬ 
tury, when in 1904 Johannes Hartmann observed 
that some absorption lines in the spectra of cer¬ 
tain binary stars were not Doppler shifted by 
the motions of the stars like the other lines. It 
was concluded that these absorption lines were 
formed in gas clouds in the space between the 
Earth and the stars. In some stars there were sev¬ 
eral lines, apparently formed in clouds moving 
with different velocities. The strongest lines in 
the visible region are those of neutral sodium 
and singly ionised calcium (Fig. 16.14). In the 
ultraviolet region, the lines are more numerous. 
The strongest one is the hydrogen Lyman a line 
(121.6 nm). 

On the basis of the optical and ultraviolet lines, 
it has been found that many atoms are ionised in 
interstellar space. This ionisation is mainly due 
to ultraviolet radiation from stars and, to some 
extent, to ionisation by cosmic rays. Since the 
density of interstellar matter is very low, the free 
electrons only rarely encounter ions, and the gas 
remains ionised. 

About thirty elements have been discovered by 
absorption line observations in the visible and ul¬ 
traviolet region. With a few exceptions, all ele¬ 
ments from hydrogen to zinc (atomic number 30) 
and a few additional heavier elements have been 
detected (Table 16.3). Like in the stars, most of 
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Table 16.1 Main 
properties of interstellar 
gas and dust 


Property 

Gas 

Dust 

Mass fraction 

10% 

0.1 % 

Composition 

HI, HU, H 2 (70 %) 

He (28 %) 

C, N, O, Ne, Na, 

Mg, Al, Si, S, ... (2 %) 

Solid particles 
dtv 0.1-1 pm 

H 2 0 (ice), silicates, 
graphite + impurities 

Particle density 

1 /cm 3 

10“ 13 /cm 3 = 100 /km 3 

Mass density 

10“ 21 kg/m 3 

10“ 23 kg/m 3 

Temperature 

100 K (HI), 10 4 K(HII) 

50 K (H 2 ) 

10-20 K 

Method of study 

Absorption lines in stellar spectra. 
Optical: 

Cal, Call, Nal, 

KI, Till, Fel, 

CN, CH, CH+ 

Ultraviolet: 

H 2 , CO, HD 

Radio lines: 

hydrogen 21 cm 
emission and absoiption; 

HII, Hell, CII 
recombination lines; 
molecular emission and 
absorption lines 

OH, H 2 CO, NH 3 ,H 2 0, 

CO, h 2 c 2 hcn, c 2 h 5 oh 

Absorption and scattering of 
starlight. 

Interstellar reddening 
Interstellar polarisation 
Thermal infrared emission 


the mass is hydrogen (about 70 %) and helium 
(almost 30 %). On the other hand, heavy elements 
are significantly less abundant than in the Sun and 
other population I stars. It is thought that they 
have been incorporated into dust grains, where 
they do not produce any absorption lines. The 
element abundances in the interstellar medium 
(gas + dust) would then be normal, although the 
interstellar gas is depleted in heavy elements. 
This interpretation is supported by the observa¬ 
tion that in regions where the amount of dust is 
smaller than usual, the element abundances in the 
gas are closer to normal. 

Atomic Hydrogen Ultraviolet observations 
have provided an excellent way of studying in¬ 
terstellar neutral hydrogen. The strongest inter¬ 
stellar absorption line, as has already been men¬ 
tioned, is the hydrogen Lyman a line (Fig. 16.15). 
This line corresponds to the transition of the elec¬ 
tron in the hydrogen atom from a state with prin¬ 
cipal quantum number n = 1 to one with n = 2. 


The conditions in interstellar space are such that 
almost all hydrogen atoms are in the ground 
state with n = 1. Therefore the Lyman a line is 
a strong absorption line, whereas the B aimer ab¬ 
sorption lines, which arise from the excited initial 
state n — 2, are unobservable. (The Balmer lines 
are strong in stellar atmospheres with tempera¬ 
tures of about 10,000 K, where a large number of 
atoms are in the first excited state.) 

The first observations of the interstellar Lyman 
a line were made from a rocket already in 1967. 
More extensive observations comprising 95 stars 
were obtained by the OAO 2 satellite. The dis¬ 
tances of the observed stars are between 100 and 
1000 parsecs. 

Comparison of the Lyman a observations with 
observations of the 21 cm neutral hydrogen line 
have been especially useful. The distribution of 
neutral hydrogen over the whole sky has been 
mapped by means of the 21 cm line. However, 
the distances to nearby hydrogen clouds are diffi¬ 
cult to determine from these observations. In the 
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Table 16.2 Phenomena caused by the interstellar medium 


Observable phenomenon 

Cause 

Interstellar extinction and polarisation 

Non-spherical dust grains aligned by magnetic field 

Dark nebulae, uneven distribution of stars and galaxies 

Dust clouds 

Interstellar absorption lines in stellar spectra 

Atoms and molecules in the interstellar gas 

Reflection nebulae 

Interstellar dust clouds illuminated by nearby stars 

Emission nebulae or HII regions (optical, infrared and 
radio emission) 

Interstellar gas and dust cloud, where a nearby hot star 
ionises the gas and heats the dust to 50-100 K 

Optical galactic background (diffuse galactic light) 

Interstellar dust illuminated by the integrated light of all 
stars 


Galactic background radiation: 


(a) short wavelength (< 1 m) 

Free-free emission from hot interstellar gas 

(b) long wavelength (> 1 m) 

Synchrotron radiation from cosmic ray electrons in the 
magnetic field 

Galactic 21 cm emission 

Cold (100 K) interstellar neutral hydrogen clouds (HI 
regions) 

Molecular line emission (extended) 

Giant molecular clouds (masses even 10 5 —10 6 Mg), 
dark nebulae 

Point-like OH, H 2 O and SiO sources 

Maser sources near protostars and long-period variables 


Fig. 16.14 (a) The D 

lines D\ and £>2 of 
interstellar sodium (rest 
wavelengths 589.89 and 
589.00 nm) in the spectrum 
of the star HD 14134. Both 
lines consist of two 
components formed in the 
gas clouds of two spiral 
arms. The radial velocity 
difference of the arms is 
about 30 km/s. (Mt. Wilson 
Observatory), (b) The 
interstellar absorption lines 
of ionised calcium Ca II 
and ionised methylidyne 
CH + in the spectra of 
several stars. The emission 
spectrum of iron is shown 
for comparison in (a) and 
(b). (Lick Observatory) 



Lyman a observations one usually knows the dis¬ 
tance to the star in front of which the absorbing 
clouds must lie. 

The average gas density within about 1 kpc 
of the Sun derived from the Lyman a observa¬ 
tions is 0.7 atoms/cm 3 . Because the interstellar 
Lyman a line is so strong, it can be observed even 


in the spectra of very nearby stars. For exam¬ 
ple, it has been detected by the Copernicus satel¬ 
lite in the spectrum of Arcturus, whose distance 
is only 11 parsecs. The deduced density of neu¬ 
tral hydrogen between the Sun and Arcturus is 
0.02-0.1 atoms/cm 3 . Thus the Sun is situated in 
a clearing in the interstellar medium, where the 
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Table 16.3 Element abundances in the interstellar 
medium towards f Ophiuchi and in the Sun. The abun¬ 
dances are given relative to that of hydrogen, which has 
been defined to be 1,000,000. An asterisk (*) means that 


the abundance has been determined from meteorites. The 
last column gives the ratio of the abundances in the inter¬ 
stellar medium and in the Sun 


Atomic number 

Name 

Chemical symbol 

Interstellar abundance 

Solar abundance 

Abundance ratio 

1 

Hydrogen 

H 

1,000,000 

1,000.000 

1.00 

2 

Helium 

He 

85,000 

85,000 

^1 

3 

Lithium 

Li 

0.000051 

0.00158* 

0.034 

4 

Beryllium 

Be 

r<0.000070 

0.000012 

n<5.8 

5 

Boron 

B 

0.000074 

0.0046* 

0.016 

6 

Carbon 

C 

74 

370 

0.20 

7 

Nitrogen 

N 

21 

110 

0.19 

8 

Oxygen 

O 

172 

660 

0.26 

9 

Fluorine 

F 

- 

0.040 

- 

10 

Neon 

Ne 

- 

83 

- 

11 

Sodium 

Na 

0.22 

1.7 

0.13 

12 

Magnesium 

Mg 

1.05 

35 

0.030 

13 

Aluminium 

A1 

0.0013 

2.5 

0.00052 

14 

Silicon 

Si 

0.81 

35 

0.023 

15 

Phosphorus 

P 

0.021 

0.27 

0.079 

16 

Sulfur 

S 

8.2 

16 

0.51 

17 

Chlorine 

Cl 

0.099 

0.45 

0.22 

18 

Argon 

Ar 

0.86 

4.5 

0.19 

19 

Potassium 

K 

0.010 

0.11 

0.094 

20 

Calcium 

Ca 

0.00046 

2.1 

0.00022 

21 

Scandium 

Sc 

- 

0.0017 

- 

22 

Titanium 

Ti 

0.00018 

0.055 

0.0032 

23 

Vanadium 

V 

P<0.0032 

0.013 

p0.25 

24 

Chromium 

Cr 

n<0.002 

0.50 

H<0.004 

25 

Manganese 

Mn 

0.014 

0.26 

0.055 

26 

Iron 

Fe 

0.28 

25 

0.011 

27 

Cobalt 

Co 

<0.19 

0.032 

p5.8 

28 

Nickel 

Ni 

0.0065 

1.3 

0.0050 

29 

Copper 

Cu 

0.00064 

0.028 

0.023 

30 

Zinc 

Zn 

0.014 

0.026 

0.53 


density is less than one tenth of the average den¬ 
sity. 

If a hydrogen atom in its ground state absorbs 
radiation with a wavelength smaller than 91.2 nm, 
it will be ionised. Knowing the density of neutral 
hydrogen, one can calculate the expected distance 
a 91.2 nm photon can propagate before being ab¬ 
sorbed in the ionisation of a hydrogen atom. Even 
in the close neighbourhood of the Sun, where the 


density is exceptionally low. the mean free path 
of a 91.2 nm photon is only about a parsec and 
that of a 10 nm photon a few hundred parsecs. 
Thus only the closest neighbourhood of the Sun 
can be studied in the extreme ultraviolet (XUV) 
spectral region. 

The Hydrogen 21 cm Line The spins of the 
electron and proton in the neutral hydrogen 
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Fig. 16.15 Interstellar absorption lines in the ultraviolet 
spectrum of f Ophiuchi. The strongest line is the hydro¬ 
gen Lyman a line (equivalent width, more than 1 nm). 


The observations were made with the Copernicus satellite. 
(Morton, D.C. (1975): Astrophys. J. 197, 85) 


atom in the ground state may be either paral¬ 
lel or opposite. The energy difference between 
these two states corresponds to the frequency of 
1420.4 MHz. Thus transitions between these two 
hyperfine structure energy levels will give rise to 
a spectral line at the wavelength of 21.049 cm 
(Fig. 5.8). The existence of the line was theo¬ 
retically predicted by Hendrick van de Hulst in 
1944, and was first observed by Harold Ewen 
and Edward Purcell in 1951. Studies of this line 
have revealed more about the properties of the 
interstellar medium than any other method—one 
might even speak of a special branch of 21 cm as¬ 
tronomy. The spiral structure and rotation of the 
Milky Way and other galaxies can also be studied 
by means of the 21 cm line. 

Usually the hydrogen 21 cm line occurs in 
emission. Because of the large abundance of hy¬ 
drogen, it can be observed in all directions in 
the sky. Some observed 21 cm line profiles are 
shown in Fig. 16.16. Rather than frequency or 
wavelength, the radial velocity calculated from 
the Doppler formula is plotted on the horizon¬ 
tal axis. This is because the broadening of the 
21 cm spectral line is always due to gas mo¬ 
tions either within the cloud (turbulence) or of the 
cloud as a whole. The vertical axis is mostly plot¬ 
ted in terms of the antenna temperature 7 a (see 
Chap. 5), the usual radio astronomical measure 
of intensity. The brightness temperature of an ex¬ 
tended source is then T\, — 7’\/f/B. where ;;b is 
the beam efficiency of the antenna. 

For the 21 cm line hv/k — 0.07 K, and thus 
hv/kT <<C 1 for all relevant temperatures. One 
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Fig. 16.16 Hydrogen 21 cm emission line profiles in the 
galactic plane at longitude 180°, 90° and 1° (in the direc¬ 
tion / = 0° there is strong absorption). The horizontal axis 
gives the radial velocity according to the Doppler formula, 
the vertical axis gives the brightness temperature. (Burton, 
W.B. (1974): “The Large Scale Distribution of Neutral 
Hydrogen in the Galaxy”, in Galactic and Extra-Galac¬ 
tic Radio Astronomy, ed. by Verschuur, G.L.. Kellermann, 
K.I. (Springer, Berlin, Heidelberg, New York) p. 91) 


may therefore use the Rayleigh-Jeans approxi¬ 
mation (5.24) 


Iv 


2v 2 kT 


(16.18) 


In the solution of the equation of radiative trans¬ 
fer (5.42) the intensity can thus be directly related 
to a corresponding temperature. By definition, 
I v is related to the brightness temperature 7b, and 
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the source function S v is related to the excitation 
temperature Zexc> i- e - 

Tb — 7e XC (l — e _Tv ). (16.19) 

In certain directions in the Milky Way there is 
so much hydrogen along the line of sight that the 
21 cm line is optically thick, r v 1. In that case 

?b = T exc , (16.20) 

i.e. the brightness temperature immediately yields 
the excitation temperature of the cloud. This is 
often referred to as the spin temperature 7s. 

The excitation temperature need not always 
agree with the kinetic temperature of the gas. 
However, in the present case the population num¬ 
bers of the hyperfine levels are determined by 
mutual collisions of hydrogen atoms: the time 
between collisions is 400 years on the average, 
whereas the time for spontaneous radiative transi¬ 
tions is 11 million years; thus the excitation tem¬ 
perature will be the same as the kinetic tempera¬ 
ture. The observed temperature is T ta 125 K. 

The distance to a source cannot be obtained 
directly from the observed emission. Thus one 
can only study the number of hydrogen atoms in 
a cylinder with a 1 cm 2 base area extending from 
the observer to outside the Milky Way along the 
line of sight. This is called the projected or col¬ 
umn density and is denoted by N. One may also 
consider the column density N(v)dv of atoms 
with velocities in the interval [v, v + du]. 

It can be shown that if the gas is optically 
thin, the brightness temperature in a spectral line 
is directly proportional to the column density N 
of atoms with the corresponding radial velocity. 
Hence, if the diameter 7 of a cloud along the line 
of sight is known, the gas density can be deter¬ 
mined from the observed line profile: 

n = N/L. 

The diameter L can be obtained from the ap¬ 
parent diameter, if the distance and shape of the 
cloud are assumed known. 

The distances of clouds can be determined 
from their radial velocities by making use of the 
rotation of the Milky Way (Sect. 18.3). Thus if 


the observed peaks in the 21 cm line profiles 
(Fig. 16.16) are due to individual clouds, their 
distances and densities can be obtained. Since 
radio observations are not affected by extinc¬ 
tion, it has been possible in this way to map the 
density distribution of neutral hydrogen in the 
whole galactic plane. The resulting distribution, 
based on observations at Leiden and Parkes, is 
shown in Fig. 16.17. It appears that the Milky 
Way is a spiral galaxy and that the interstel¬ 
lar hydrogen is concentrated in the spiral arms. 
The average density of interstellar hydrogen is 
1 atom/cm 3 , but the distribution is very inhomo¬ 
geneous. Typically the hydrogen forms denser re¬ 
gions, a few parsecs in size, where the densities 
may be 10-100 atoms/cm 3 . Regions where the 
hydrogen is predominantly neutral are known as 
HI regions (in contrast to HII regions of ionised 
hydrogen). 

The hydrogen 21 cm line may also occur in 
absorption, when the light from a bright radio 
source, e.g. a quasar, passes through an interven¬ 
ing cloud. The same cloud may give rise to both 
an absorption and an emission spectrum. In that 
case the temperature, optical thickness and hy¬ 
drogen content of the cloud can all be derived. 

Like interstellar dust hydrogen is concentrated 
in a thin disk in the galactic plane. The thickness 
of the hydrogen layer is about twice that of the 
dust or about 200 pc. 

HII Regions In many parts of space hydrogen 
does not occur as neutral atoms, but is ionised. 
This is true in particular around hot O stars, 
which radiate strongly in the ultraviolet. If there 
is enough hydrogen around such a star, it will 
be visible as an emission nebula of ionised hy¬ 
drogen. Such nebulae are known as HII region 
(Figs. 16.18 and 16.19). 

A typical emission nebula is the great nebula 
in Orion, M42. It is visible even to the unaided 
eye, and is a beautiful sight when seen through 
a telescope. In the middle of the nebula there is 
a group of four hot stars known as the Trapez¬ 
ium, which can be distinguished inside the bright 
nebula, even with a small telescope. The Trapez¬ 
ium stars emit strong ultraviolet radiation, which 
keeps the gas nebula ionised. 
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Fig. 16.17 The distribution of neutral hydrogen in the galaxy from the Leiden and Parkes surveys. The density is given 
in atoms/cm 3 . (Oort, J.H., Kerr, P.T., Westerhout, G.L. (1958): Mon. Not. R. Astron. Soc. 118. 379) 


Unlike a star a cloud of ionised gas has 
a spectrum dominated by a few narrow emission 
lines. The continuous spectrum of HII regions is 
weak. In the visible region the hydrogen B aimer 
emission lines are particularly strong. These are 
formed when a hydrogen atom recombines into 
an excited state and subsequently returns to the 
ground state via a sequence of radiative transi¬ 
tions. Typically a hydrogen atom in a HII re¬ 
gion remains ionised for several hundred years. 
Upon recombination it stays neutral for some 
months, before being ionised again by a photon 
from a nearby star. 

The number of recombinations per unit time 
and volume is proportional to the product of the 
densities of electrons and ions, 

(16.21) 


In completely ionised hydrogen, n e = n,. and 
hence 

M rec OC«e- (16.22) 

Most recombinations will include the transition 
n = 3 —> 2, i.e. will lead to the emission of a Hq, 
photon. Thus the surface brightness of a nebula in 
the Hq. line will be proportional to the emission 
measure. 



where the integral is along the line of sight 
through the nebula. 

The ionisation of a helium atom requires more 
energy than that of a hydrogen atom, and thus re¬ 
gions of ionised helium are formed only around 


n rec oc n t n\. 
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Fig. 16.18 The great 
nebula in Orion (M42, 
NGC 1976). The nebula 
gets its energy from newly 
formed hot stars. The dark 
regions are opaque dust 
clouds in front of the 
nebula. Radio and infrared 
observations have revealed 
a rich molecular cloud 
behind the nebula 
(Fig. 16.20). In the upper 
part of this picture is the 
gas nebula NGC 1977, in 
the lower part the bright 
star l Orionis. (Lick 
Observatory) 



the hottest stars. In these cases, a large HII region 
will surround a smaller central He + or He ++ re¬ 
gion. The helium lines will then be strong in the 
spectrum of the nebula. 

Although hydrogen and helium are the main 
constituents of clouds, their emission lines are 
not always strongest in the spectrum. At the be¬ 
ginning of this century it was even suggested 
that some strong unidentified lines in the spec¬ 
tra of nebulae were due to the new element neb- 
ulium. However, in 1927 Ira S. Bowen showed 
that they were forbidden lines of ionised oxygen 
and nitrogen, 0 + , 0 ++ and N + . Forbidden lines 
are extremely difficult to observe in the labora¬ 
tory, because their transition probabilities are so 
small that at laboratory densities the ions are de- 


excited by collisions before they have had time 
to radiate. In the extremely diffuse interstellar 
gas, collisions are much less frequent, and thus 
there is a chance that an excited ion will make 
the transition to a lower state by emitting a pho¬ 
ton. 

Because of interstellar extinction, only the 
nearest HII regions can be studied in visible light. 
At infrared and radio wavelengths much more 
distant regions can be studied. The most impor¬ 
tant lines at radio wavelengths are recombination 
lines of hydrogen and helium; thus the hydro¬ 
gen transition between energy levels 110 and 109 
at 5.01 GHz has been much studied. These lines 
are also important because with their help radial 
velocities, and hence (using the galactic rotation 
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Fig. 16.19 The Lagoon 
nebula (M8, NGC 6523) in 
Sagittarius. This HII 
region contains many stars 
of early spectral types and 
stars that are still 
contracting towards the 
main sequence. Small, 
round dark nebulae, 
globules, are also visible 
against the bright 
background. These are 
presumably gas clouds in 
the process of condensation 
into stars. (National 
Optical Astronomy 
Observatories, Kitt Peak 
National Observatory) 



law), distances of HII regions can be determined, 
just as for neutral hydrogen. 

The physical properties of HII regions can 
also be studied by means of their continuum radio 
emission. The radiation is due to bremsstrahlung 
or free-free emission from the electrons. The in¬ 
tensity of the radiation is proportional to the emis¬ 
sion measure EM defined in (16.23). HII regions 
also have a strong infrared continuum emission. 
This is thermal radiation from dust inside the neb¬ 
ula. 

HII regions are formed when a hot O or B star 
begins to ionise its surrounding gas. The ionisa¬ 
tion steadily propagates away from the star. Be¬ 
cause neutral hydrogen absorbs ultraviolet radia¬ 
tion so efficiently, the boundary between the HII 
region and the neutral gas is very sharp. In a ho¬ 
mogeneous medium the HII region around a sin¬ 
gle star will be spherical, forming a Stromgren 
sphere. For a BO V star the radius of the Strom¬ 
gren sphere is 50 pc and for an AO V star only 
1 pc. 

The temperature of a HII region is higher 
than that of the surrounding gas, and it therefore 
tends to expand. After millions of years, it will 
have become extremely diffuse and will eventu¬ 
ally merge with the general interstellar medium. 


16.3 Interstellar Molecules 

The first interstellar molecules were discov¬ 
ered in 1937-1938, when molecular absorp¬ 
tion lines were found in the spectra of some 
stars. Three simple diatomic molecules were de¬ 
tected: methylidyne CH, its positive ion CH + 
and cyanogen CN. A few other molecules were 
later discovered by the same method in the ul¬ 
traviolet. Thus molecular hydrogen H 2 was dis¬ 
covered in the early 1970’s, and carbon monox¬ 
ide, which had been discovered by radio ob¬ 
servations, was also detected in the ultraviolet. 
Molecular hydrogen is the most abundant in¬ 
terstellar molecule, followed by carbon monox¬ 
ide. 

Molecular Hydrogen The detection and study 
of molecular hydrogen has been one of the 
most important achievements of UV astronomy. 
Molecular hydrogen has a strong absorption band 
at 105 nm, which was first observed in a rocket 
experiment in 1970 by George R. Carruthers, but 
more extensive observations could only be made 
with the Copernicus satellite. The observations 
showed that a significant fraction of interstellar 
hydrogen is molecular, and that this fraction in¬ 
creases strongly for denser clouds with higher ex- 
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Fig. 16.20 Radio map of 
the distribution of carbon 
monoxide 13 C ls O in the 
molecular cloud near the 
Orion nebula. The curves 
are lines of constant 
intensity. (Kutner, M.L., 
Evans II, N.J., Tucker, 
K.D. (1976): Astrophys. J. 
209, 452) 



tinction. In clouds with visual extinction larger 
than one magnitude essentially all the hydrogen 
is molecular. 

Hydrogen molecules are formed on the surface 
of interstellar grains, which thus act as a chem¬ 
ical catalyst. Dust is also needed to shield the 
molecules from the stellar UV radiation, which 
would otherwise destroy them. Molecular hydro¬ 
gen is thus found where dust is abundant. It is of 
interest to know whether gas and dust are well 
mixed or whether they form separate clouds and 
condensations. 

UV observations have provided a reliable way 
of comparing the distribution of interstellar gas 
and dust. The amount of dust between the ob¬ 
server and a star is obtained from the extinction of 
the stellar light. Furthermore, the absorption lines 
of atomic and molecular hydrogen in the ultravi¬ 
olet spectrum of the same star can be observed. 
Thus the total amount of hydrogen (atomic + 
molecular) between the observer and the star can 
also be determined. 

Observations indicate that the gas and dust 
are well mixed. The amount of dust giving rise 
to one magnitude visual extinction corresponds 
to 1.9 x 10 21 hydrogen atoms (one molecule is 
counted as two atoms). The mass ratio of gas and 
dust obtained in this way is 100. 


Radio Spectroscopy Absorption lines can only 
be observed if there is a bright star behind the 
molecular cloud. Because of the large dust extinc¬ 
tion, no observations of molecules in the densest 
clouds can be made in the optical and ultravio¬ 
let spectral regions. Thus only radio observations 
are possible for these objects, where molecules 
are especially abundant. 

Radio spectroscopy signifies an immense step 
forward in the study of interstellar molecules. In 
the early 1960’s, it was still not believed that there 
might be more complicated molecules than di¬ 
atomic ones in interstellar space. It was thought 
that the gas was too diffuse for molecules to 
form and that any that formed would be destroyed 
by ultraviolet radiation. The first molecular radio 
line, the hydroxyl radical OH, was discovered in 
1963. Many other molecules have been discov¬ 
ered since then. By 2002, about 130 molecules 
had been detected, the heaviest one being the 
13-atom molecule HCnN. 

Molecular lines in the radio region may be ob¬ 
served either in absorption or in emission. Ra¬ 
diation from diatomic molecules like CO (see 
Fig. 16.20) may correspond to three kinds of 
transitions. (1) Electron transitions correspond to 
changes in the electron cloud of the molecule. 
These are like the transitions in single atoms, and 
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Table 16.4 Some molecules observed in the interstellar 
medium 


Molecule 

Name 

Year of discovery 

Discovered in the optical and ultraviolet region: 

CH 

methylidyne 

1937 

CH+ 

methylidyne ion 

1937 

CN 

cyanogen 

1938 

h 2 

hydrogen molecule 

1970 

CO 

carbon monoxide 

1971 


Discovered in the radio region: 


OH 

hydroxyl radical 

1963 

CO 

carbon monoxide 

1970 

CS 

carbon monosulfide 

1971 

SiO 

silicon monoxide 

1971 

SO 

sulfur monoxide 

1973 

h 2 o 

water 

1969 

HCN 

hydrogen cyanide 

1970 

nh 3 

ammonia 

1968 

h 2 co 

formaldehyde 

1969 

HCOOH 

formic acid 

1975 

HCCNC 

isocyanoacetylene 

1991 

c 2 h 4 o 

vinyl alcohol 

2001 

h 2 cccc 

cumulene carbene 

1991 

(ch 3 ) 2 o 

dimethyl ether 

1974 

c 2 h 5 oh 

ethanol 

1975 

HCnN 

cyanopentacetylene 

1981 


their wavelengths lie in the optical or ultraviolet 
region. (2) Vibrational transitions correspond to 
changes in the vibrational energy of the molecule. 
Their energies are usually in the infrared region. 
(3) Most important for radio spectroscopy are the 
rotational transitions, which are changes in the 
rotational energy of the molecule. Molecules in 
their ground state do not rotate, i.e. their angular 
momentum is zero, but they may be excited and 
start rotating in collisions with other molecules. 
For example, carbon sulfide CS returns to its 
ground state in a few hours by emitting a millime¬ 
tre region photon. 

A number of interstellar molecules are listed 
in Table 16.4. Many of them have only been de¬ 
tected in the densest clouds (mainly the Sagittar¬ 
ius B2 cloud at the galactic centre), but others are 
very common. The most abundant molecule H 2 
cannot be observed at radio wavelengths, because 


it has no suitable spectral lines. The next most 
abundant molecules are carbon monoxide CO, 
the hydroxyl radical OH and ammonia NH 3 , al¬ 
though their abundance is only a small fraction 
of that of hydrogen. However, the masses of in¬ 
terstellar clouds are so large that the number 
of molecules is still considerable. (The Sagittar¬ 
ius B2 cloud contains enough ethanol, C 2 H 5 OH, 
for 10 28 bottles of vodka.) 

Both the formation and survival of interstel¬ 
lar molecules requires a higher density than is 
common in interstellar clouds; thus they are most 
common in dense clouds. Molecules are formed 
in collisions of atoms or simpler molecules or 
catalysed on dust grains. Molecular clouds must 
also contain a lot of dust to absorb the ultravio¬ 
let radiation entering from outside that otherwise 
would disrupt the molecules. The most suitable 
conditions are thus found inside dust and molec¬ 
ular clouds near dense dark nebulae and HII re¬ 
gions. 

Most of the molecules in Table 16.4 have only 
been detected in dense molecular clouds occur¬ 
ring in connection with HII regions. Almost ev¬ 
ery molecule yet discovered has been detected in 
Sagittarius B2 near the galactic centre. Another 
very rich molecular cloud has been observed near 
the HII region Orion A. In visible light this re¬ 
gion has long been known as the Orion neb¬ 
ula M42 (Fig. 16.18). Inside the actual HII re¬ 
gions there are no molecules, since they would be 
rapidly dissociated by the high temperature and 
strong ultraviolet radiation. Three types of molec¬ 
ular sources have been found near HII regions 
(Fig. 16.21): 

1. Large gas and dust envelopes around the HII 

region. 

2. Small dense clouds inside these envelopes. 

3. Very compact OH and H 2 O maser sources. 

The large envelopes have been discovered pri¬ 
marily by CO observations. OH and H 2 CO have 
also been detected. Like in the dark nebulae the 
gas in these clouds is probably mainly molecular 
hydrogen. Because of the large size and density 
(n 10 3 — 10 4 molecules/cm 3 ) of these clouds, 
their masses are very large, 10 5 or even 10 6 solar 
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Table 16.5 The five 
phases of interstellar gas 




T 

[K] 

n 

[cm -3 ] 

1 . 

Very cold molecular gas clouds (mostly hydrogen H 2 ) 

20 

>10 3 

2. 

Cold gas clouds (mostly atomic neutral hydrogen) 

100 

20 

3. 

Warm neutral gas enveloping the cooler clouds 

6000 

0.05-0.3 

4. 

Hot ionised gas (mainly HII regions around hot stars) 

8000 

>0.5 

5. 

Very hot and diffuse ionised coronal gas, ionised and 
heated by supernova explosions 

10 6 

10“ 3 



Right ascension 5 h 32 min 

Fig. 16.21 Infrared map of the central part of the Orion 
nebula. In the lower part are the four Trapezium stars. 
Above is an infrared source of about 0.5" diameter, the 
Kleinmann-Low nebula (KL). BN is an infrared point 
source, the Becklin-Neugebauer object. Other infrared 
sources are denoted IRS. The large crosses indicate OH 
masers and the small crosses H 2 O masers. On the scale 
of Fig. 16.18 this region would only be a few millimetres 
in size. (Goudis, C. (1982): The Orion Complex: A Case 
Study of Interstellar Matter (Reidel, Dordrecht) p. 176) 

masses (SgrB2). They are among the most mas¬ 
sive objects in the Milky Way. The dust in molec¬ 
ular clouds can be observed on the basis of its 
thermal radiation. Its peak falls at wavelengths of 
10-100 pm, corresponding to a dust temperature 
of 30-300K. 

Some interstellar clouds contain very small 
maser sources. In these the emission lines of OH, 


HtO and SiO may be many million times stronger 
than elsewhere. The diameter of the radiating re¬ 
gions is only about 5-10 au. The conditions in 
these clouds are such that radiation in some spec¬ 
tral lines is amplified by stimulated emission as it 
propagates through the cloud. Hydroxyl and wa¬ 
ter masers occur in connection with dense HII 
regions and infrared sources, and appear to be re¬ 
lated to the formation of protostars. In addition 
maser emission (OH, H 2 O and SiO) occurs in 
connection with Mira variables and some red su¬ 
pergiant stars. This maser emission comes from 
a molecule and dust envelope around the star, 
which also gives rise to an observable infrared ex¬ 
cess. 


16.4 The Formation of Protostars 

The mass of the Milky Way is about 10 11 solar 
masses. Since its age is about 10 10 years, stars 
have been forming at the average rate of 10 Mq 
per year. This estimate is only an upper limit for 
the present rate, because earlier the rate of star 
formation must have been much higher. Since 
the lifetime of O stars is only about a million 
years, a better estimate of the star formation rate 
can be made, based on the observed number of 
O stars. Accordingly, it has been concluded that at 
present, new stars are forming in the Milky Way 
at a rate of about three solar masses per year. 

Stars are now believed to form inside large 
dense interstellar clouds mostly located in the 
spiral arms of the Galaxy. Under its own grav¬ 
ity, a cloud begins to contract and fragment into 
parts that will become protostars. The observa¬ 
tions seem to indicate that stars are not formed 
individually, but in larger groups. Young stars are 
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found in open clusters and in loose associations, 
typically containing a few hundred stars which 
must have formed simultaneously. 

Theoretical calculations confirm that the for¬ 
mation of single stars is almost impossible. An 
interstellar cloud can contract only if its mass is 
large enough for gravity to overwhelm the pres¬ 
sure. As early as in the 1920’s, James Jeans cal¬ 
culated that a cloud with a certain temperature 
and density can condense only if its mass is high 
enough. If the mass is too small the pressure of 
the gas is sufficient to prevent the gravitational 
contraction. The limiting mass is the Jeans mass 
(Sect. 6.11), given by 


Mj 



where n is the density in atoms/m 3 and 7 the tem¬ 
perature. 

In a typical interstellar neutral hydrogen cloud 
n — 10 6 and T — 100 K, giving the Jeans mass, 
30,000 Mq. In the densest dark clouds n = 10 12 
and T — 10 K and hence, Mj = I Mq. 

It is thought that star formation begins in 
clouds of a few thousand solar masses and diam¬ 
eters of about 10 pc. The cloud begins to contract, 
but does not heat up because the liberated energy 
is carried away by radiation. As the density in¬ 
creases. the Jeans mass thus decreases. Because 
of this, separate condensation nuclei are formed 
in the cloud, which go on contracting indepen¬ 
dently: the cloud fragments. Fragmentation is fur¬ 
ther advanced by the increasing rotation velocity. 
The original cloud has a certain angular momen¬ 
tum which is conserved during the contraction; 
thus the angular velocity must increase. 

This contraction and fragmentation continues 
until the density becomes so high that the individ¬ 
ual fragments become optically thick. The energy 
liberated by the contraction can then no longer 
escape, and the temperature will begin to rise. In 
consequence the Jeans mass begins to increase, 
further fragmentation ceases and the rising pres¬ 
sure in existing fragments stops their contrac¬ 
tion. Some of the protostars formed in this way 
may still be rotating too rapidly. These may split 


into two, thus forming double systems. The fur¬ 
ther evolution of protostars has been described in 
Sect. 11.2. 

Although the view that stars are formed by 
the collapse of interstellar clouds is generally ac¬ 
cepted, many details of the fragmentation process 
are still highly conjectural. Thus the effects of ro¬ 
tation, magnetic fields and energy input are very 
imperfectly known. Why a cloud begins to con¬ 
tract is also not certain; one theory is that passage 
through a spiral arm compresses clouds and trig¬ 
gers contraction (see Sect. 17.4). This would ex¬ 
plain why young stars are predominantly found in 
the spiral arms of the Milky Way and other galax¬ 
ies. The contraction of an interstellar cloud might 
also be initiated by a nearby expanding HII re¬ 
gion or supernova explosion. 

Star formation can be observed particularly 
well in the infrared, since the temperatures of the 
condensing clouds and protostars are of the or¬ 
der 100-1000 K and the infrared radiation can 
escape even the densest dust clouds. For exam¬ 
ple, in connection with the Orion nebula there is 
a large cloud of hydrogen, found in radio obser¬ 
vations, containing small infrared sources. E.g. 
the Becklin-Neugebauer object has a temperature 
of a couple of hundred kelvins but a luminosity 
that is thousandfold compared with the Sun. It is 
a strong FFO maser source, located next to a large 
HII region. 


16.5 Planetary Nebulae 

Bright regions of ionised gas do not occur only 
in connection with newly formed stars, but also 
around stars in late stages of their evolution. 
The planetary nebulae are gas shells around 
small hot blue stars. As we have seen in con¬ 
nection with stellar evolution, instabilities may 
develop at the stage of helium burning. Some 
stars begin to pulsate, while in others the whole 
outer atmosphere may be violently ejected into 
space. In the latter case, a gas shell expanding at 
20-30 km/s will be formed around a small and 
hot (50,000-100,000 K) star, the core of the orig¬ 
inal star. 

The expanding gas in a planetary nebula is 
ionised by ultraviolet radiation from the central 
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Fig. 16.22 The Helix 
nebula (NGC 7293). The 
planetary nebulae are 
formed during the final 
stages of evolution of 
solar-type stars. The 
centrally visible star has 
ejected its outer layers into 
space. (National Optical 
Astronomy Observatories, 
Kitt Peak National 
Observatory) 



star, and its spectrum contains many of the same 
bright emission lines as that of an HII region. 
Planetary nebulae are, however, generally much 
more symmetrical in shape than most HII re¬ 
gions, and they expand more rapidly. For exam¬ 
ple, the well-known Ring nebula in Lyra (M57) 
has expanded visibly in photographs taken at 
50-year intervals. In a few ten thousand years, the 
planetary nebulae disappear in the general inter¬ 
stellar medium and their central stars cool to be¬ 
come white dwarfs. 

The planetary nebulae were given their name 
in the 19th century, because certain small nebulae 
visually look quite like planets such as Uranus. 
The apparent diameter of the smallest known 
planetary nebulae is only a few arc seconds, 
whereas the largest ones (like the Helix nebula) 
may be one degree in diameter (Fig. 16.22). 

The brightest emission lines are often due to 
forbidden transitions, like in HII regions. For ex¬ 
ample, the green colour of the central parts of 
the Ring nebula in Lyra is due to the forbid¬ 
den lines of doubly ionised oxygen at 495.9 and 
500.7 nm. The red colour of the outer parts is due 
to the hydrogen Balmer a line (656.3 nm) and 
the forbidden lines of ionised nitrogen (654.8 nm, 
658.3 nm). 


The total number of planetary nebulae in the 
Milky Way has been estimated to be 50,000. 
About 2000 have actually been observed. 


16.6 Supernova Remnants 

In Chap. 11 we have seen that massive stars end 
their evolution in a supernova explosion. The col¬ 
lapse of the stellar core leads to the violent ejec¬ 
tion of the outer layers, which then remain as an 
expanding gas cloud. 

About 120 supernova remnants (SNR’s) have 
been discovered in the Milky Way. Some of them 
are optically visible as a ring or an irregular neb¬ 
ula (e.g. the Crab nebula; see Fig. 16.23), but 
most are detectable only in the radio region (be¬ 
cause radio emission suffers no extinction). 

In the radio region the SNR’s are extended 
sources similar to HII regions. However, unlike 
HII regions the radiation from SNR’s is often 
polarised. Another characteristic difference be¬ 
tween these two kinds of sources is that whereas 
the radio brightness of HII regions grows or re¬ 
mains constant as the frequency increases, that of 
SNR’s falls off almost linearly (in a log I v — log v 
diagram) with increasing frequency (Fig. 16.24). 

These differences are due to the different 
emission processes in HII regions and in SNR’s. 
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Fig. 16.23 The Crab 
nebula (Ml, NGC 1952) in 
Taurus is the remnant of 
a supernova explosion 
observed in 1054. The 
photograph was taken at 
red wavelengths. The 
nebula is also a strong 
radio source. Its energy 
source is the central rapidly 
rotating neutron star, 
pulsar, which is the 
collapsed core of the 
original star. (Palomar 
Observatory) 




Fig. 16.24 The radio spectra of typical H 11 regions and 
supernova remnants. The radiation of HII regions is ther¬ 
mal and obeys the Rayleigh-Jeans law, I <x v 2 , at wave¬ 
lengths larger than 1 m. In supernova remnants the in¬ 
tensity decreases with increasing frequency. (After Schef- 
fler, H., Elsasser, H. (1987): Physics of the Galaxy and 
the Interstellar Matter (Springer, Berlin, Heidelberg, New 
York)) 

In an HII region, the radio emission is free-free 
radiation from the hot plasma. In a SNR it is 
synchrotron radiation from relativistic electrons 
moving in spiral orbits around the magnetic field 


lines. The synchrotron process gives rise to a con¬ 
tinuous spectrum extending over all wavelength 
regions. For example, the Crab nebula looks blue 
or green in colour photographs because of optical 
synchrotron radiation. 

In the Crab nebula red filaments are also visi¬ 
ble against the bright background. Their emission 
is principally in the hydrogen H (y line. The hy¬ 
drogen in a SNR is not ionised by a central star 
as in the HII regions, but by the ultraviolet syn¬ 
chrotron radiation. 

The supernova remnants in the Milky Way fall 
into two classes. One type has a clearly ring-like 
structure (e.g. Cassiopeia A or the Veil nebula in 
Cygnus; see Fig. 16.25); another is irregular and 
bright at the middle (like the Crab nebula). In the 
remnants of the Crab nebula type there is always 
a rapidly rotating pulsar at the centre. This pulsar 
provides most of the energy of the remnant by 
continuously injecting relativistic electrons into 
the cloud. The evolution of this type of SNR re¬ 
flects that of the pulsar and for this reason has 
a time scale of a few ten thousand years. 

Ring-like SNR’s do not contain an energetic 
pulsar; their energy comes from the actual super¬ 
nova explosion. After the explosion, the cloud ex¬ 
pands at a speed of 10,000-20,000 km/s. About 
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Fig. 16.25 The Veil 
nebula (NGC 6960 at the 
right , NGC 6992 at the left) 
in Cygnus is the remnant of 
a supernova explosion 
which occurred several ten 
thousand years ago. (Mt. 
Wilson Observatory) 



50-100 years after the explosion the remnant be¬ 
gins to form a spherical shell as the ejected gas 
starts to sweep up interstellar gas and to slow 
down in its outer parts. The swept-up shell ex¬ 
pands with a decreasing velocity and cools until, 
after about 100,000 years, it merges into the inter¬ 
stellar medium. The two types of supernova rem¬ 
nants may be related to the two types (I and II) of 
supernovae. 


Box 16.1 (Synchrotron Radiation) Synchro¬ 
tron radiation was first observed in 1948 by 
Frank Elder , Robert Langmuir and Herbert 
Pollack, who were experimenting with an elec¬ 
tron synchrotron, in which electrons were ac¬ 
celerated to relativistic energies in a magnetic 


field. It was observed that the electrons radi¬ 
ated visible light in a narrow cone along their 
momentary direction of motion. In astrophysics 
synchrotron radiation was first invoked as an 
explanation of the radio emission of the Milky 
Way, discovered by Karl Jansky in 1931. This 
radiation had a spectrum and a large metre- 
wave brightness temperature (more than 10 5 K) 
which were inconsistent with ordinary ther¬ 
mal free-free emission from ionised gas. In 
1950 Hannes A Ifven and Nicolai Herlofson as 
well as Karl-Otto Kiepenheuer proposed that 
the galactic radio background was due to syn¬ 
chrotron radiation. According to Kiepenheuer 
the high-energy cosmic ray electrons would 
emit radio radiation in the weak galactic mag- 
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netic field. This explanation has turned out to 
be correct. Synchrotron radiation is also an 
important emission process in supernova rem¬ 
nants, radio galaxies and quasars. It is a non- 
thermal radiation process, i.e. the energy of the 
radiating electrons is not due to thermal mo¬ 
tions. 



The emission of synchrotron radiation. A charged par¬ 
ticle (electron) propagating in a magnetic field moves 
in a spiral. Because of the centripetal acceleration, the 
particle emits electromagnetic radiation 

The origin of synchrotron radiation is 
schematically shown in the figure. The mag¬ 
netic field forces the electron to move in a spi¬ 
ral orbit. The electron is thus constantly accel¬ 
erated and will emit electromagnetic radiation. 
According to the special theory of relativity, 
the emission from a relativistic electron will be 
concentrated in a narrow cone. Like the beam 
from a lighthouse, this cone sweeps across the 
observer’s field of vision once for each revo¬ 
lution. Thus the observer sees a sequence of 
radiation flashes of very short duration com¬ 
pared with their interval. (In the total emis¬ 
sion of a large number of electrons, separate 
flashes cannot be distinguished.) When this se¬ 
ries of pulses is represented as a sum of differ¬ 
ent frequency components (Fourier transform), 
a broad spectrum is obtained with a maximum 
at 

fmax = ClB±E~, 

where B± is the magnetic field component per¬ 
pendicular to the velocity of the electron, and E 
its energy, a is a constant of proportionality. 


The table gives the frequency and wave¬ 
length of the maximum as functions of the 
electron energy for the typical galactic field 
strength 0.5 nT: 


^max 

^max [Hz] 

E [eV] 

300 nm 

10 15 

6.6 x 10 12 

30 |jm 

10 13 

6.6 x 10 u 

3 mm 

10 u 

6.6 x 10'° 

30 cm 

10 9 

6.6 x 10 9 

30 m 

10 7 

6.6 x 10 8 


To produce even radio synchrotron radia¬ 
tion, very energetic electrons are required, but 
these are known to be present in the cosmic ra¬ 
diation. In the optical galactic background radi¬ 
ation, the contribution from synchrotron radia¬ 
tion is negligible, but, for example, in the Crab 
nebula, a significant part of the optical emission 
is due to this mechanism. 


16.7 The Hot Corona of the Milky 
Way 

As early as 1956 Lyman Spitzer showed that the 
Milky Way has to be surrounded by a large en¬ 
velope of very hot gas (Fig. 16.26). Almost two 
decades later the Copernicus satellite, whose sci¬ 
entific program was directed by Spitzer, found 
evidence for this kind of gas, which began to 
be called galactic coronal gas, in analogy with 
the solar corona. The satellite observed emis¬ 
sion lines of e.g. five times ionised oxygen 
(OVI), four times ionised nitrogen (NV) and 
triply ionised carbon (CIV). The formation of 
these lines requires a high temperature (100,000- 
1,000,000 K), and a high temperature is also in¬ 
dicated by the broadening of the lines. 

Galactic coronal gas is distributed through 
the whole Milky Way and extends several thou¬ 
sand parsecs from the galactic plane. Its den¬ 
sity is only of the order of 10 -3 atoms/cm 3 (re¬ 
call that the mean density in the galactic plane 
is 1 atom/cm 3 ). Thus coronal gas forms a kind 
of background sea, from which the denser and 
cooler forms of interstellar matter, such as neu¬ 
tral hydrogen and molecular clouds, rise as is¬ 
lands. In the early 1980’s the IUE satellite also 
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Fig. 16.26 Galaxy’s hot corona. NGC 2403 is a spi¬ 
ral galaxy, similar to our Milky Way. On the right, it 
is photographed in visual light. On the left, in a VLA 
radio image, on the same scale with the optical image, 

detected similar coronae in the Large Magellanic 
Cloud and in the spiral galaxy Ml00. Coronal gas 
is probably quite a common and important form 
of matter in galaxies. 

Supernova explosions are probably the source 
of both coronal gas and its energy. When a super¬ 
nova explodes, it forms a hot bubble in the sur¬ 
rounding medium. The bubbles from neighbour¬ 
ing supernovae will expand and merge, forming 
a foamlike structure. In addition to supernovae, 
stellar winds from hot stars may provide some of 
the energy of the coronal gas. 


16.8 Cosmic Rays and the 

Interstellar Magnetic Field 

Cosmic Rays Elementary particles and atomic 
nuclei reaching the Earth from space are called 
cosmic rays. They occur throughout interstellar 
space with an energy density of the same order 
of magnitude as that of the radiation from stars. 
Cosmic rays are therefore important for the ioni¬ 
sation and heating of interstellar gas. 

Since cosmic rays are charged, their direction 
of propagation in space is constantly changed 
by the magnetic field. Their direction of arrival 


a large hydrogen corona is seen around the galaxy. Large 
holes created by supernova explosions are seen in the gas 
corona. (Image NRAO/AUI and Tom Oosterloo, Astron, 
The Netherlands) 

therefore gives no information about their place 
of origin. The most important properties of cos¬ 
mic rays that can be observed from the Earth 
are their particle composition and energy distri¬ 
bution. As noted in Sect. 3.6, these observations 
have to be made in the upper atmosphere or from 
satellites, since cosmic ray particles are destroyed 
in the atmosphere. 

The main constituent of the cosmic rays (about 
90 %) is hydrogen nuclei or protons. The second 
most important constituent (about 9 %) is helium 
nuclei or a particles. The rest of the particles are 
electrons and nuclei more massive than helium. 

Most cosmic rays have an energy smaller than 
10 9 eV. The number of more energetic particles 
drops rapidly with increasing energy. The most 
energetic protons have an energy of 10 2 ° eV, but 
such particles are very rare—the energy of one 
such proton could lift this book about one cen¬ 
timetre. (The largest particle accelerators reach 
“only” energies of 10 12 eV.) 

The distribution of low-energy (less than 
10 8 eV) cosmic rays cannot be reliably deter¬ 
mined from the Earth, since solar “cosmic rays”, 
high-energy protons and electrons formed in so¬ 
lar flares fill the solar system and strongly affect 
the motion of low-energy cosmic rays. 
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Fig. 16.27 The polarisation of starlight. The dashes give 
the direction and degree of the polarisation. The thinner 
dashes correspond to stars with polarisation smaller than 
0.6 %; the thicker dashes to stars with larger polarisation. 

The distribution of cosmic rays in the Milky 
Way can be directly inferred from gamma-ray and 
radio observations. The collisions of cosmic ray 
protons with interstellar hydrogen atoms gives 
rise to pions which then decay to form a gamma- 
ray background. The radio background is formed 
by cosmic ray electrons which emit synchrotron 
radiation in the interstellar magnetic field. 

Both radio and gamma-ray emission are stron¬ 
gly concentrated in the galactic plane. From this 
it has been concluded that the sources of cosmic 
rays must also be located in the galactic plane. In 
addition there are individual peaks in the back¬ 
grounds around known supernova remnants. In 
the gamma-ray region such peaks are observed at 
e.g. the Crab nebula and the Vela pulsar; in the ra¬ 
dio region the North Polar Spur is a large, nearby 
ring-like region of enhanced emission. 

Apparently a large fraction of cosmic rays 
have their origin in supernovae. An actual super¬ 
nova explosion will give rise to energetic parti¬ 
cles. If a pulsar is formed, observations show that 
it will accelerate particles in its surroundings. Fi¬ 
nally the shock waves formed in the expanding 
supernova remnant will also give rise to relativis¬ 
tic particles. 

On the basis of the relative abundances of var¬ 
ious cosmic ray nuclei, it can be calculated how 
far they have travelled before reaching the Earth. 
It has been found that typical cosmic ray protons 


The scale is shown in the upper left-hand corner. Stars 
with polarisation smaller than 0.08 % are indicated by 
a small circle. (Mathewson, D.S., Ford, V.L. (1970): Mem. 
R.A.S. 74, 139) 

have travelled for a period of a few million years 
(and hence also a distance of a few million light- 
years) from their point of origin. Since the diame¬ 
ter of the Milky Way is about 100,000 light-years, 
the protons have crossed the Milky Way tens of 
times in the galactic field. 

The Interstellar Magnetic Field The strength 
and direction of the interstellar magnetic field are 
difficult to determine reliably. Direct measure¬ 
ments are impossible, since the magnetic fields 
of the Earth and the Sun are much stronger. How¬ 
ever, using various sources it has been possible to 
deduce the existence and strength of the field. 

We have already seen that interstellar grains 
give rise to interstellar polarisation. In order to 
polarise light, the dust grains have to be similarly 
oriented; this can only be achieved by a general 
magnetic field. Figure 16.27 shows the distribu¬ 
tion of interstellar polarisation over the sky. Stars 
near each other generally have the same polarisa¬ 
tion. At low galactic latitudes the polarisation is 
almost parallel to the galactic plane, except where 
one is looking along a spiral arm. 

More precise estimates of the strength of the 
magnetic field can be obtained from the rotation 
of the plane of polarisation of the radio radia¬ 
tion from distant sources. This Faraday rotation is 
proportional to the strength of the magnetic field 
and to the electron density. Another method is to 
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measure the Zeeman splitting of the 21 cm radio 
line. These measurements have fairly consistently 
given a value of 10 -10 -10 -9 T for the strength of 
the interstellar magnetic field. This is about one 
millionth of the interplanetary field in the solar 
system. 


16.9 Examples 

Example 16.1 Estimate the dust grain size and 
number density in the galactic plane. 

Let us compare the interstellar extinction 
curve in Fig. 16.4(b) with the Mie scattering 
curves in Fig. 16.3. We see that the leftmost 
parts of the curves may correspond to each 
other: the interval 0 < x < 5 with m = 1.5 in 
Fig. 16.3 matches the interval 0 < 1/A < 5 pm -1 
in Fig. 16.4(b). Remembering that x — 2na/X, 
this suggests a constant grain radius a, given by 
2ita 1 pm, or a ss 0.16 pm. 

In the blue wavelength region (A = 0.44 pm), 
x = 2.3 and, according to the upper Fig. 16.3, 
<2 ext ^ 2. Using A = 2 mag for the interstellar ex¬ 
tinction at r — 1 kpc, we get, substituting (16.5) 
into (16.7), n 4 x 10 -7 m -3 . This should give 
the order of magnitude of the interstellar dust 
density. 

As a summary we could say that a consider¬ 
able fraction of the interstellar extinction might 
be due to grains of diameter 0.3 pm and particle 
density of the order of 10“ 7 m -3 = 100 km -3 . 

Example 16.2 Estimate the time interval be¬ 
tween successive collisions of a hydrogen atom 
in interstellar gas. 

Two atoms will collide if the separation be¬ 
tween their centres becomes less than 2 r, where 
r is the radius of the atom. Thus, the microscopic 
cross section for the collision is ct = n(2r) 2 — 
4jtr 2 . The macroscopic cross section, or the num¬ 
ber of collisions of an H atom per unit length, is 
then E = no , where n is the number density of 
the FI atoms. The mean free path / of an atom 
is the inverse of the macroscopic cross section, 
/ = I / E, and the time between two collisions is 
t = l/v, where i> is the velocity of the atom. 

Considering the numerical values, the Bohr ra¬ 
dius of an H atom is r = 5.3 x 10 -11 m. Taking 


n = 1 cm -3 we get / = 2.8 x 10 13 m « 0.0009 pc. 
The average velocity is not far from the root mean 
square velocity at T = 125 K, given by (5.33): 

v = = 1760 ms -1 . 

V m 

These values of / and v give t = l/v — 510 years 
for the collision interval. Taking into account the 
velocity distribution in the gas, the mean free path 
appears to be shorter by a factor of 1 / V 2 , which 
reduces the time to about 400 years. 

Example 16.3 Consider the lowest rotational 
transition of the CO molecule. For 12 CO the fre¬ 
quency of this line is u( 12 CO) = 115.27 GFIz, 
and for 13 CO, v( 13 CO) = 110.20 GHz. Estimate 
the optical thickness of each line in a molec¬ 
ular cloud, where the observed brightness tem¬ 
peratures of the lines are 7b( 12 CO) = 40 K and 
T b ( 13 CO) = 9 K. 

For the 12 CO line, hv/k = 5.5 K. Thus, the 
Rayleigh-Jeans approximation is valid if the tem¬ 
perature is considerably higher than 5 K. This is 
not always the case, but the measured value of 
T b ( 12 CO) suggests that the approximation can be 
used. 

Ignoring the background, (16.19) gives 
7b = T exc (1 — e T "). 

The optical thickness x v is proportional to the 
opacity or the absorption coefficient a v [see 
(4.16)], and a v is evidently proportional to the 
number of CO molecules present. Other differ¬ 
ences between the lines are small, so we can write 

Ty ( 12 CO) _ n( 12 CO) 

Ty( 13 CO) ~ n( 13 CO) 

Adopting the terrestrial value n( 12 CO)/n( 13 CO) 
= 89, we set 

r u ( 12 CO) = 89ry( 13 CO). 

Assuming the excitation temperatures equal and 
denoting Ty( 12 CO) by r, we get 

7exc(l — e -T ) = 40, 

7exc(l ~ e -T / 89 ) = 9. 
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The solution of this pair of equations is 

r„( 12 CO) =23, r y ( 13 CO) = 0.25, 
r exc = 40 K. 

Thus, the 12 CO line seems to be optically thick, 
and 7’e X c = 7b( 12 CO). If also the 13 CO line 
were optically thick, the brightness temperatures 
would be practically equal, and the optical thick¬ 
nesses could not be determined. 


16.10 Exercises 

Exercise 16.1 Two open clusters, which are 
seen near each other in the galactic plane, have 
angular diameters a and 3a, and distance mod¬ 
uli 16.0 and 11.0, respectively. Assuming their 


actual diameters are equal, find their distances 
and the interstellar extinction coefficient a in 
(16.4). 

Exercise 16.2 Estimate the free fall velocity on 
the surface of a spherical gas cloud contracting 
under the influence of its own gravity. Assume 
n( H 2 ) = 10 3 cm -3 and R = 5 pc. 

Exercise 16.3 The force F exerted by a mag¬ 
netic field B on a charge q moving with veloc¬ 
ity v is F — qv x B. If v is perpendicular to B, 
the path of the charge is circular. Find the radius 
of the path of an interstellar proton with a kinetic 
energy of 1 MeV. Use B = 0.1 nT for the galactic 
magnetic field. 
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Several collections of stars can be picked out in 
the sky, even with the naked eye. Closer study re¬ 
veals that they really do form separate clusters in 
space. E.g. the Pleiades in Taurus and the Hyades 
around Aldebaran, the brightest star in Taurus, 
are such open star clusters. Almost the whole of 
the constellation Coma Berenices is also an open 
star cluster. Many objects appearing as nebulous 
patches to the unaided eye, when looked at with 
a telescope, turn out to be star clusters, like Prae- 
sepe in the constellation Cancer, or double cluster 
in Perseus (Fig. 17.1). In addition to open clusters 
some apparently nebulous objects are very dense 
globular clusters, such as those in Hercules and 
in Canes Venatici (Fig. 17.2). 

The first catalogue of star clusters was pre¬ 
pared by the French astronomer Charles Messier 
in 1784. The first version contained only 45 
objects, but later Messier himself and Pierre 
Mechain expanded the catalogue to include 103 
objects. Later seven more objects were added, 
possibly observed by Messier. Messier was inter¬ 
ested in comets, not in those fuzzy objects, and 
the reason for the catalogue was to avoid false 
alarms. The catalogue contains a mixture of very 
different objects, like about 30 globular clusters 
and the same number of open clusters, gas nebu¬ 
lae and galaxies. 

A larger catalogue, published in 1888, was the 
New General Catalogue of Nebulae and Clus¬ 
ters of Stars prepared by the Danish astronomer 
John Louis Emil Dreyer. The catalogue numbers 
of objects in this list are preceded by the initials 
NGC. For example, the large globular cluster in 


Hercules is object M13 in the Messier catalogue, 
and it is also known as NGC 6205. The NGC cat¬ 
alogue was supplemented with the Index Cata¬ 
logue in 1895 and 1910. The objects of this cata¬ 
logue are given the initials IC. 

With a small telescope stars of distant clus¬ 
ters and galaxies cannot be distinguished as sep¬ 
arate objects; instead the target looks nebulous. 
Therefore their true nature was only little by lit¬ 
tle with spectroscopy and large telescopes. That’s 
why those old catalogues contain all kinds of dif¬ 
ferent objects. 

Even now there is no catalogue of open clus¬ 
ter complete to some limiting magnitude. The 
data are collected from several catalogues pub¬ 
lished by various astronomers, and therefore the 
nomenclature is not consistent. The problem is 
that open clusters concentrate close the plane of 
Milky Way, and there are a lot of background 
stars. Thus identifying the cluster members may 
require rather detailed studies of the properties of 
the stars. Also the members may already have dis¬ 
persed to a very wide area making it difficult to 
decide which of them belong to the same clus¬ 
ter. 


17.1 Associations 

In 1947 the Soviet astronomer Viktor Amaza- 
spovich Ambartsumyan discovered that there are 
groups of young stars scattered over so large re¬ 
gions of the sky that they would be very difficult 
to identify merely on the basis of their appear¬ 
ance. These associations may have a few tens of 
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members. One association is found around the 
star £ Persei, and in the region of Orion, there are 
several associations. 

Associations are groups of very young stars. 
They are usually identified on the basis either 
of absolutely bright main sequence stars or of 
TTauri stars. According to the type, one speaks 
of OB associations and T Tauri associations. The 
most massive stars of spectral class O stay on the 
main sequence for only a few million years, and 
therefore associations containing them are neces¬ 
sarily young. The TTauri stars are even younger 
stars that are in the process of contracting towards 
the main sequence. 

Studies of the internal motions in associations 
show that they are rapidly dispersing. There are 


so few stars in an association that their gravity 
cannot hold them together for any length of time. 
The observed motions have often confirmed that 
the stars in an association were very close to¬ 
gether a few million years ago (Fig. 17.3). 

Large amounts of interstellar matter, gas and 
dust nebulae often occur in connection with asso¬ 
ciations, supplying information about the connec¬ 
tion between star formation and the interstellar 
medium. Infrared observations have shown that 
stars are now forming or have recently formed in 
many dense interstellar clouds. 

Associations are strongly concentrated in the 
spiral arms in the plane of the Milky Way. Both in 
the Orion region and in the direction of Cepheus, 
three generations of associations have been iden- 



Fig. 17.1 Open clusters, (a) The Hyades slightly to the 
lower left in the photograph. Above them to the right the 
Pleiades. (Photograph M. Korpi.) (b) The Pleiades pho¬ 
tographed with the Metsahovi Schmidt camera. The di¬ 
ameter of the cluster is about 1°. Reflection nebulae are 
visible around some of the stars. (Photograph M. Pouta- 


nen and H. Virtanen, Helsinki University.) h and / Persei, 
the double cluster in Perseus. The separation between 
the clusters is about 25'. Picture taken with the Met¬ 
sahovi 60-cm Ritchey Chretien telescope. (Photograph T. 
Markkanen, Helsinki University) 
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Fig. 17.2 The globular 
cluster w Centauri. The 
picture was taken with the 
Danish 1.5-m telescope at 
La Silla, Chile. Thanks to 
the excellent seeing, one 
can see through the entire 
cluster in some places. 
(Photograph T. Korhonen, 
Turku University) 



Fig. 17.3 f Persei 
association. O and B stars 
are shown as open circles. 
The proper motion vectors 
show the movements of the 
stars in the next 500,000 
years 



tified, the oldest ones being most extended and 
the youngest ones, most dense. 


17.2 Open Star Clusters 

Open clusters usually contain from a few tens to 
a few hundreds of stars. The kinetic energy of 
the cluster members, the differential rotation of 


the Milky Way (Sect. 17.3) and external gravita¬ 
tional disturbances tend to gradually disperse the 
open clusters. Still, many of them are fairly per¬ 
manent; for example, the Pleiades is many hun¬ 
dreds of millions of years old, but nevertheless, 
quite a dense cluster. 

The distances of star clusters—and also of 
associations—can be obtained from the photo- 
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metric or spectroscopic distances of their bright¬ 
est members. For the nearest clusters, in partic¬ 
ular for the Hyades, one can use the method of 
kinematic parallaxes, which is based on the fact 
that the stars in a cluster all have the same average 
space velocity with respect to the Sun. The proper 
motions in the Hyades are shown in Fig. 17.4. 
They all appear to be directed to the same point. 
Figure 17.5 explains how this convergence can be 
understood as an effect of perspective, if all clus¬ 
ter members have the same velocity vector with 
respect to the observer. Let 9 be the angular dis¬ 
tance of a given star from the convergence point. 
The angle between the velocity of the star and 


the line of sight will then also be 9. The veloc¬ 
ity components along the line of sight and at right 
angles to it, v r and i; t , are therefore given by 


v T = v cosO, 
Vt — v sin 9. 


(17.1) 


The radial velocity v T can be measured from the 
Doppler shift of the stellar spectrum. The tangen¬ 
tial velocity i; t is related to the proper motion p 
and the distance r: 


v t — pr. (17.2) 


Thus the distance can be calculated: 



a 5h 4h 30 min 4h 


Fig. 17.4 Proper motions of the Hyades. The vectors 
show the movement of the stars in about 10,000 years, 
(van Bueren, H.G. (1952): Bull. Astr. Inst. Neth. 11) 



Fig. 17.5 If all stars move in the same direction, their tan¬ 
gential velocity components appear to be directed towards 
the convergence point K 


Vt v sin 9 v T 
r = — = -= — tan 9. 

p /x p 


(17.3) 


By means of this method, the distances of the 
individual stars can be determined from the mo¬ 
tion of the cluster as a whole. Since the method 
of (ground-based) trigonometric parallaxes is re¬ 
liable only out to a distance of 30 pc, the mov¬ 
ing cluster method is an indispensable way of de¬ 
termining stellar distances. The distance of the 
Hyades obtained in this way is about 40 pc. The 
distance obtained from the trigonometric paral¬ 
laxes measured directly by the Hipparcos satellite 
in the 1990’s is 46 pc. The Hyades is the nearest 
open cluster. 

The observed HR diagram or the correspond¬ 
ing colour-magnitude diagram of the Hyades 
and other nearby star clusters show a very well- 
defined and narrow main sequence (Fig. 17.6). 
Most of the cluster members are main sequence 
stars; there are only a few giants. There are quite 
a few stars slightly less than one magnitude above 
the main sequence. These are apparently binary 
stars whose components have not been resolved. 
To see this, let us consider a binary, where both 
components have the same magnitude m and the 
same colour index. If this system is unresolved, 
the colour index will still be the same, but the ob¬ 
served magnitude will be m — 0.75, i.e. slightly 
less than one magnitude brighter. 

The main sequences of open clusters are gen¬ 
erally located in the same section of the HR or 
colour-magnitude diagram (Fig. 17.7). This is be¬ 
cause the material from which the clusters formed 
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Fig. 17.6 Colour-magnitude diagram of the Hyades. Ap¬ 
parent visual magnitude on the left-hand vertical axis ; ab¬ 
solute visual magnitude on the right-hand one 



Fig. 17.7 Schematic colour-magnitude diagrams of star 
clusters. M3 is a globular cluster; the others are open clus¬ 
ters. Cluster ages are shown along the main sequence. The 
age of a cluster can be told from the point where its stars 
begin to turn off the main sequence. (Sandage, A. (1956): 
Publ. Astron. Soc. Pac. 68. 498) 


has not varied much, i.e. their initial chemical 
composition has been fairly constant. In younger 
clusters the main sequence extends to brighter 
and hotter stars and earlier spectral types. Usu¬ 
ally one can clearly see the point in the diagram 


where the main sequence ends and bends over to¬ 
wards the giant branch. This point will depend 
very strongly on the age of the cluster. It can 
therefore be used in determining the ages of open 
clusters. Star clusters are of central importance in 
the study of stellar evolution. 

The colour-magnitude diagrams of star clus¬ 
ters can also be used to determine their distances. 
The method is called main sequence fitting. By 
means of multicolour photometry the reddening 
due to interstellar dust can be removed from the 
observed colours B — V of the stars, yielding the 
intrinsic colours (B — V )q. Most star clusters are 
so far away from us that all cluster members can 
be taken to be at the same distance. The distance 
modulus 

mv 0 -M v = 51g —^— (17.4) 

10 pc 

will then be the same for all members. In (17.4), 
my 0 is the apparent, My the absolute visual mag¬ 
nitude of a star, and r the distance. It has been 
assumed that the extinction due to interstellar 
dust Ay has been determined from multicolour 
photometry and its effect removed from the ob¬ 
served visual magnitude m y\ 


my 0 — my — Ay. 


When the observed colour-magnitude dia¬ 
gram of the cluster is plotted using the apparent 
magnitude my 0 rather than the absolute magni¬ 
tude My on the vertical axis, the only change 
will be that the position of the main sequence 
is shifted vertically by an amount correspond¬ 
ing to the distance modulus. The observed (my 0 , 
(B — U)o) diagram may now be compared with 
the Hyades (My, ( B — V )o) diagram used as 
a standard. By demanding that the main se¬ 
quences of the two diagrams agree, the distance 
modulus and hence the distance can be deter¬ 
mined. The method is very accurate and efficient. 
It can be used to determine cluster distances out 
to many kiloparsecs. 


17.3 Globular Star Clusters 

Globular star clusters usually contain about 10 5 
stars. The distribution of the stars is spherically 
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symmetric, and the central densities are about ten 
times larger than in open clusters. Stars in glob¬ 
ular clusters are among the oldest in the Milky 
Way, and therefore they are of great importance 
for studies of stellar evolution. There are about 
150-200 globular clusters in the Milky Way. 

The colour-magnitude diagram of a typical 
globular cluster is shown in Fig. 17.8. The main 
sequence only contains faint red stars; there is 
a prominent giant branch, and the horizontal and 
asymptotic branches are clearly seen. The main 
sequence is lower than that of the open clusters, 
because the metal abundance is much lower in the 
globular clusters. 

The horizontal branch stars have a known ab¬ 
solute magnitude, which has been calibrated us¬ 
ing principally RRLyrae type variables. Because 
the horizontal branch stars are bright, they can 
be observed even in distant clusters, and thus us¬ 
ing them the distances of globular clusters can be 
well determined. 

Using the known distances, the linear sizes of 
globular clusters can be calculated. It is found 
that most of the mass is concentrated to a central 
core with a radius of about 0.3-10 pc. Outside 
this there is an extended envelope with a radius 
that may be 10-100 times larger. At even larger 
radii stars will escape from the cluster because of 
the tidal force of the Galaxy. 



Fig. 17.8 Colour-magnitude diagram of the globular 
cluster M5. In addition to the main sequence one can see 
the giant branch bending to the right and to its left the hor¬ 
izontal brunch. (Arp, H. (1962): Astrophys. J. 135, 311) 


The masses of globular clusters can be roughly 
estimated from the virial theorem, if the stel¬ 
lar velocities in the cluster have been measured. 
More precise values are calculated by fitting theo¬ 
retical models to the observed density and veloc¬ 
ity distributions. In this way masses in the range 
10 4 —10 6 Mq have been obtained. 

The globular clusters in the Milky Way fall 
into two classes. In the classification given in 
Table 18.1 these correspond to intermediate and 
halo population II. The disk globular clusters are 
concentrated towards the centre and the plane of 
the Milky Way and they form a system that is 
rotating with the general rotation of the Milky 
Way. In contrast, the halo clusters are almost 
spherically distributed in an extensive distribution 
reaching out to at least 35 kpc. The system of halo 
clusters does not rotate, but instead the velocities 
of individual clusters are uniformly distributed in 
all directions. The abundance of heavy elements 
is also different in the two classes of clusters. For 
disk clusters it is typically about 30 % of the so¬ 
lar value, for halo clusters it is only about 1 %. In 
some clusters the heavy element abundances are 
only 10 -3 times the solar value. The smallest val¬ 
ues, as low as 10 _4 -10 -5 have been detected in 
some field stars of the halo. They therefore give 
important information about the production of el¬ 
ements in the early Universe and during the for¬ 
mation of the Milky Way. 

All globular clusters are old, and the halo clus¬ 
ters are among the oldest known astronomical ob¬ 
jects. Determining a precise age is difficult, and 
requires both accurate observations of the turn¬ 
off point of the main sequence in the HR dia¬ 
gram, as well detailed theoretical stellar evolu¬ 
tion models. The ages obtained have been about 
13 x 10 9 years. This age is close to the age of 
the Universe calculated from its rate of expansion 
(see Chap. 20). 


17.4 Example 

Example 17.1 Assume that a globular cluster 
has a diameter of 40 pc and contains 100,000 stars 
of one solar mass each. 

(a) Use the virial theorem to find the average ve¬ 
locity of the stars. You can assume that the 
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average distance between stars equals the ra¬ 
dius of the cluster. 

(b) Find the escape velocity. 

(c) Comparing these velocities, can you tell 
something about the stability of the cluster? 


(a) First, we have to estimate the potential en¬ 
ergy. There are n(n — l)/2 n 1 2 /2 pairs of 
stars in the cluster, and the average distance 
of each pair is R. Thus the potential energy is 
about 


2 2 
m n 

U = -G -, 

R 2 


where m = 1 Mq. The kinetic energy is 


(b) The escape velocity from the edge of the 
cluster is 

l2Gmn 

= \! 4i> 2 = 2v = 6 km s~ 1 . 

(c) No. The average velocity seems to be smaller 
than the escape velocity, but it was derived 
from the virial theorem assuming that the 
cluster is stable. 
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where v is the root mean square velocity. Ac¬ 
cording to the virial theorem we have T — 
— 1/2 U, whence 

1 2 1 m 2 n 2 

—mvn — —G -. 

2 2 R 2 

Solving for the velocity we get 

v 2 

Gmn 
~ 2 R 

_ 6.7x 10 -11 m 3 * kg- 1 s“ 2 x2.0xl0 30 kgxl0 5 * 
” 40x3.lxlO 16 m 

= 1.1 x 10 7 m 2 s -2 , 


Exercise 17.1 A globular cluster consists of 
100,000 stars of the solar absolute magnitude. 
Calculate the total apparent magnitude of the 
cluster, if its distance is 10 kpc. 

Exercise 17.2 If the apparent magnitude of the 
cluster of the previous exercise is 10, what is its 
distance if the interstellar absorption in the direc¬ 
tion of the cluster is 1.5 mag/kpc? 

Exercise 17.3 The Pleiades open cluster con¬ 
tains 230 stars within 4 pc. Estimate the veloci¬ 
ties of the stars in the cluster using the virial the¬ 
orem. For simplicity, let the mass of each star be 
replaced by 1 Mq. 


which gives v 3 km s 1 




The Milky Way 


On clear, moonless nights a nebulous band of 
light can be seen stretching across the sky. This 
is the Milky Way (Fig. 18.1). The name is used 
both for the phenomenon in the sky and for the 
large stellar system causing it. The Milky Way 
system is also called the Galaxy—with a capital 
letter. The general term galaxy is used to refer to 
the countless stellar systems more or less like our 
Milky Way. 

The band of the Milky Way extends round the 
whole celestial sphere. It is a huge system con¬ 
sisting mostly of stars, among them the Sun. The 
stars of the Milky Way form a flattened disk-like 
system. In the direction of the plane of the disk 
huge numbers of stars are visible, whereas rela¬ 
tively few are seen in the perpendicular direction. 
The faint light of distant stars merges into a uni¬ 
form glow, and therefore the Milky Way appears 
as a nebulous band to the naked eye. A long- 
exposure photograph reveals hundreds of thou¬ 
sands of stars (Fig. 18.2). 

In the early 17th century Galileo Galilei, us¬ 
ing his first telescope, discovered that the Milky 
Way consists of innumerable stars. Already in 
the late 18th century William Herschel attempted 
to determine the size and shape of the Milky 
Way by means of star counts. Only early in the 
20th century did the Dutch astronomer Jacobus 
Kapteyn obtain the first estimate of the size of 
the Milky Way. The true size of the Milky Way 
and the Sun’s position in it became clear finally 
in the 1920’s from Harlow Shapley’s studies of 
the space distribution of globular clusters. 


In studying the structure of the Milky Way, it is 
convenient to choose a spherical coordinate sys¬ 
tem so that the fundamental plane is the symme¬ 
try plane of the Milky Way. This is defined as the 
symmetry plane of the distribution of neutral hy¬ 
drogen, and it agrees quite closely with the sym¬ 
metry plane defined by the distribution of stars in 
the solar neighbourhood (within a few kpc). 

The basic direction in the fundamental plane 
has been chosen to be the direction of the centre 
of the Milky Way. This is located in the constella¬ 
tion Sagittarius (a = 17 h45.7 min, S — — 29° 00', 
epoch 2000.0) at a distance of about 8.5 kpc. The 
galactic latitude is counted from the plane of the 
Galaxy to its pole, going from 0° to + 90°, and 
to the galactic south pole, from 0° to — 90°. The 
galactic coordinate system is shown in Fig. 18.3 
(see also Sect. 2.8). 


18.1 Methods of Distance 
Measurement 

In order to study the structure of the Milky Way, 
one needs to know how various kinds of objects, 
such as stars, star clusters and interstellar matter, 
are distributed in space. The most important ways 
of measuring the distances will first be consid¬ 
ered. 

Trigonometric Parallaxes The method of tri¬ 
gonometric parallaxes is based on the apparent 
yearly back-and-forth movement of stars in the 
sky, caused by the orbital motion of the Earth. 
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Fig. 18.1 The nebulous band of the Milky Way stretches across the entire sky. (Photograph M. and T. Keskiila, Lund 
Observatory) 


From Earth-based observations the trigonometric 
parallaxes can be reliably measured out to a dis¬ 
tance of about 30 pc; beyond 100 pc this method 
is no longer useful. The situation is, however, 
changing. The limit has already been pushed to 
a few hundred parsecs by the Hipparcos satellite, 
and Gaia will mean another major leap in the ac¬ 
curacy. 

The Motion of the Sun with Respect to the 
Neighbouring Stars. The Local Standard of 
Rest The motion of the Sun with respect to the 
neighbouring stars is reflected in their proper mo¬ 
tions and radial velocities (Fig. 18.4). The point 
towards which the Sun’s motion among the stars 
seems to be directed is called the apex. The op¬ 
posite point is the antapex. The stars near the 
apex appear to be approaching; their radial veloc¬ 
ities have the largest negative values, on the av¬ 
erage. In the direction of the antapex the largest 
(positive) radial velocities are observed. On the 
great circle perpendicular to the apex-antapex di¬ 
rection, the radial velocities are zero on the aver¬ 
age, but the proper motions are large. The average 
proper motions decrease towards the apex and the 
antapex, but always point from the apex towards 
the antapex. 


In order to study the true motions of the stars, 
one has to define a coordinate system with respect 
to which the motions are to be defined. The most 
practical frame of reference is defined so that the 
stars in the solar neighbourhood are at rest, on 
the average. More precisely, this local standard 
of rest (LSR) is defined as follows. 

Let us suppose the velocities of the stars be¬ 
ing considered are distributed at random. Their 
velocities with respect to the Sun, i.e. their radial 
velocities, proper motions and distances, are as¬ 
sumed to be known. The local standard of rest is 
then defined so that the mean value of the veloc¬ 
ity vectors is opposite to the velocity of the Sun 
with respect to the LSR. Clearly the mean veloc¬ 
ity of the relevant stars with respect to the LSR 
will then be zero. The motion of the Sun with re¬ 
spect to the LSR is found to be: 


Apex coordinates 

a = 18 h 00 min = 270° 

1 = 56° 

5 = +30° 

b = +23° 

Solar velocity 

vq = 19.7 kms -1 



The apex is located in the constellation of Her¬ 
cules. When the sample of stars used to deter¬ 
mine the LSR is restricted to a subset of all the 
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Fig. 18.2 A section of 
about 40° of the Milky 
Way between the 
constellations of Cygnus 
and Aquila. The brightest 
star at the upper right is 
Vega (a Lyrae). 
(Photograph Palomar 
Observatory) 



stars in the solar neighbourhood, e.g. to stars of 
a given spectral class, the sample will usually 
have slightly different kinematic properties, and 
the coordinates of the solar apex will change cor¬ 
respondingly. 

The velocity of an individual star with respect 
to the local standard of rest is called the pecu¬ 
liar motion of the star. The peculiar velocity of 
a star is obtained by adding the velocity of the 
Sun with respect to the LSR to the measured ve¬ 
locity. Naturally the velocities should be treated 
as vectors. 

The local standard of rest is at rest only with 
respect to a close neighbourhood of the Sun. The 
Sun and the nearby stars, and thus also the LSR, 


are moving round the centre of the Milky Way at 
a speed that is ten times greater than the typical 
peculiar velocities of stars in the solar neighbour¬ 
hood (Fig. 18.5). 

Statistical Parallaxes The velocity of the Sun 
with respect to neighbouring stars is about 
20 kins -1 . This means that in one year, the Sun 
moves about 4 au with respect to the stars. 

Let us consider a star S (Fig. 18.6), whose an¬ 
gular distance from the apex is # and which is at 
a distance r from the Sun. In a time interval t the 
star will move away from the apex at the angular 
velocity u/t = ha because of the solar motion. 
In the same time interval, the Sun will move the 
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distance s. The sine theorem for triangles yields 
. s sin 1 } s sin 1 ? 

r^r'=— -»-, (18.1) 

sin u u 

because the distance remains nearly unchanged 
and the angle u is very small. In addition to the 
component // ,\ due to solar motion, the observed 
proper motion has a component due to the pe¬ 
culiar velocity of the star. This can be removed 



Fig. 18.3 The directions to the galactic centre and the 
North galactic pole (NGP) in equatorial coordinates. The 
galactic longitude / is measured from the galactic centre 
along the galactic plane. The coordinates of the Galactic 
centre are precessed from the defining equinox 1950 and 
are not very accurate (see A.P. Lane (1979), PASP, 91, 
405) 


Antapex 



Apex 


Fig. 18.4 Because of the motion of the Sun towards the 
apex, the average radial velocity of the nearby stars ap¬ 
pears largest in the apex and antapex directions 


by taking an average of (18.1) for a sample of 
stars, since the peculiar velocities of the stars in 
the solar neighbourhood can be assumed to be 
randomly distributed. By observing the average 
proper motion of objects known to be at the same 
distance one thus obtains their actual distance. 
A similar statistical method can be applied to ra¬ 
dial velocities. 

Objects that are at the same distance can be 
found as follows. We know that the distance mod¬ 
ulus m — M and the distance r are related accord¬ 
ing to: 

m — M = 5 lg(r/10 pc) + A(r), (18.2) 

where A is the interstellar extinction. Thus ob¬ 
jects that have the same apparent and the same 
absolute magnitude will be at the same distance. 
It should be noted that we need not know the ab¬ 
solute magnitude as long as it is the same for all 
stars in the sample. Suitable classes of stars are 



Fig. 18.5 The local standard of rest, defined by the stars 
in the solar neighbourhood, moves with respect to the 
galactic centre. However, the average value of the stellar 
peculiar velocities with respect to the LSR is zero 



Fig. 18.6 When the Sun has moved the distance s to¬ 
wards the apex, the direction to the star S appears to have 
changed by the angle u 
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main sequence A4 stars, RRLyrae variables and 
classical cepheids with some given period. The 
stars in a cluster are also all at the same distance. 
This method has been used, for example, to de¬ 
termine the distance to the Hyades as explained 
in Sect. 17.2. 

Parallaxes based on the peculiar or apex mo¬ 
tion of the Sun are called statistical or secular par¬ 
allaxes. 

Main Sequence Fitting If the distance of a 
cluster is known, it is possible to plot its HR dia¬ 
gram with the absolute magnitude as the vertical 
coordinate. Another cluster, whose distance is to 
be determined, can then be plotted in the same 
diagram using the apparent magnitudes as the 
vertical coordinate. Now the vertical distance of 
the main sequences tells how much the apparent 
magnitudes differ from the absolute ones. Thus 
the distance modulus m — M can be measured. 
This method, known as the main sequence fitting, 
works for clusters whose stars are roughly at the 
same distance; if the distances vary too much, 
a clear main sequence cannot be distinguished. 

Photometric Parallaxes The determination of 
the distance directly from (18.2) is called the 
photometric method of distance determination 
and the corresponding parallax, the photomet¬ 
ric parallax. The most difficult task when using 
this method usually involves finding the absolute 
magnitude; there are many ways of doing this. 
For example, the two-dimensional MKK spec¬ 
tral classification allows one to determine the ab¬ 
solute magnitude from the spectrum. The abso¬ 
lute magnitudes of cepheids can be obtained from 
their periods. A specially useful method for star 
clusters is the procedure of main sequence fitting. 
A condition for the photometric method is that 
the absolute magnitude scale first be calibrated by 
some other method. 

Trigonometric parallaxes do not reach very 
far. For example, even with the Hipparcos satel¬ 
lite, only a few cepheid distances have been ac¬ 
curately measured by this method. The method 
of statistical parallaxes is indispensable for cal¬ 
ibrating the absolute magnitudes of bright ob¬ 
jects. When this has been done, the photometric 



Fig. 18.7 The observed luminosity function @(My) and 
the initial luminosity function 'l'(My) for main sequence 
stars in the solar neighbourhood. The functions give the 
number of stars per cubic parsec in the magnitude inter¬ 
val [My — 1/2, My + 1/2]; they are actually the prod¬ 
ucts DO and D0 , where D is the stellar density function 
(in the solar neighbourhood) 

method can be used to obtain distances of objects 
even further away. 

Other examples of indicators of brightness, lu¬ 
minosity criteria, are characteristic spectral lines 
or the periods of cepheids. Again, their use re¬ 
quires that they first be calibrated by means of 
some other method. It is a characteristic fea¬ 
ture of astronomical distance determinations that 
the measurement of large distances is based on 
knowledge of the distances to nearer objects. 


18.2 Stellar Statistics 

The Stellar Luminosity Function By system¬ 
atically observing all stars in the solar neighbour¬ 
hood, one can find the distribution of their abso¬ 
lute magnitudes. This is given by the luminosity 
function 0(M) (Fig. 18.7), which gives the rela¬ 
tive number of main sequence stars with absolute 
magnitudes in the range [M — 1/2, M + 1 /2], No 
stars appear to be forming at present in the re¬ 
gion of space where the luminosity function has 
been determined. The age of the Milky Way is 
10-15 Ga, which means that all stars less massive 
than 0.9 Mq, will still be on the main sequence. 
On the other hand, more massive stars, formed 
early in the history of the Milky Way, will have 
completed their evolution and disappeared. Low- 
mass stars have accumulated in the luminosity 
function for many generations of star formation, 
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whereas bright, high-mass stars are the result of 
recent star formation. 

By taking into account the different main se¬ 
quence lifetimes of stars of different masses and 
hence of different magnitudes, one can deter¬ 
mine the initial luminosity function 0 (M), which 
gives the brightness distribution at the time of star 
formation, the zero age main sequence luminos¬ 
ity function. The relation between the function 0 
and the observed luminosity function is 

0(M) = 0(M)T o /t E (M), (18.3) 

where Tp is the age of the Milky Way and 
t E (M) is the main sequence lifetime of stars of 
magnitude M. Here we assume that the birth rate 
of stars of magnitude M has remained constant 
during the lifetime of the Milky Way. The initial 
luminosity function is shown in Fig. 18.7. 



The Fundamental Equation of Stellar Statis¬ 
tics. The Stellar Density A crucial problem for 
studies of the structure of the Milky Way is to find 
out how the density of stars varies in space. The 
number of stars per unit volume at a distance r in 
the direction (/, b ) from the Sun is given by the 
stellar density D — D(r, l. b). 

The stellar density cannot be directly observed 
except in the immediate neighbourhood of the 
Sun. However, it can be calculated if one knows 
the luminosity function and the interstellar ex¬ 
tinction as a function of distance in a given direc¬ 
tion. In addition the number of stars per unit solid 
angle (e.g. per square arc second) can be deter¬ 
mined as a function of limiting apparent magni¬ 
tude by means of star counts (Fig. 18.8). 

Let us consider the stars within the solid an¬ 
gle u> in the direction (/, b) and in the distance 
range [r, r + dr] (Fig. 18.9). We let their lumi¬ 
nosity function 0(M) be the same as in the so¬ 
lar neighbourhood and their unknown stellar den¬ 
sity D. The absolute magnitude M of the stars of 
apparent magnitude m is, as usual, 

M = m — 5 lg(r/10 pc) — A(r). 

The number of stars in the apparent magnitude 
interval [m — 0.5, m + 0.5] in the volume element 


Fig. 18.8 The stellar density is determined by means of 
star counts. In practice, the counting is done on photo¬ 
graphic plates. (Cartoon S. Harris) 



Fig. 18.9 The size of the volume element at distance r in 
the direction (/, b) is cor 2 dr 


(IV — cor 2 dr at distance r is (Fig. 18.9) 


d N(m) = D(r, l, b) 


x 0 


m — 5 lg 


10 pc 


— A(r ) 


dV. 


(18.4) 

The stars of apparent magnitude m in the given 
area of the sky will in reality be at many differ¬ 
ent distances. In order to obtain their total num¬ 
ber N(m), one has to integrate dN(m) over all 












18.2 Stellar Statistics 


373 


Fig. 18.10 The stellar 
density near the Sun. 

(a) The stellar density of 
spectral classes A2-A5 in 
the galactic plane, 
according to 

S. W. McCuskey. The 
numbers next to the 
isodensity curves give the 
number of stars in 
10,000 pc 3 , (b) The 
distribution of different 
spectral classes 
perpendicularly to the 
galactic plane according to 

T. Elvius. The density in 
the galactic plane has been 
normalised to one 


a) 1=180° b) 



D 



distances r : 


-i 


N(m) = I D(r, l, b) 


X 0 


|111 — 


51g 


10 pc 


~A(r) 


cor 2 dr. 
(18.5) 


Equation (18.5) is called the fundamental equa¬ 
tion of stellar statistics. Its left-hand side, the 
number of stars in the apparent magnitude inter¬ 
val [m — 0.5, m + 0.5] in the solid angle co, is 
obtained from the observations: one counts the 
stars of different magnitudes in a chosen area of 
a photographic plate. The luminosity function is 
known from the solar neighbourhood. The extinc¬ 
tion A(r ) can be determined for the chosen areas, 
for instance, by means of multicolour photome¬ 
try. In order to solve the integral equation (18.5) 
for D(r. /, /?), several methods have been devel¬ 
oped, but we shall not go into them here. 

Figure 18.10(a) shows the stellar density in the 
solar neighbourhood in the plane of the Milky 
Way, and Fig. 18.10(b) in the direction perpen¬ 
dicular to the plane. There are several individual 
concentrations, but e.g. spiral structure cannot be 
observed in such a limited region of space. 


The Distribution of Bright Objects Using 
stellar statistical methods, one can only study 
the close neighbourhood of the Sun, out to about 
1 kpc at the most. Absolutely faint objects can¬ 


not be observed very far. Since the solar neigh¬ 
bourhood appears to be fairly representative of 
the general properties of the Milky Way, its study 
is naturally important, giving information e.g. on 
the distributions and luminosity functions of stars 
of various spectral types. However, in order to get 
an idea of the larger-scale structure of the Milky 
Way, one has to make use of objects that are as 
absolutely bright as possible, and which can be 
observed even at large distances. 

Examples of suitable objects are stars of early 
spectral types, HII regions, OB associations, 
open star clusters, cepheids, RRLyrae stars, su¬ 
pergiants and giants of late spectral types, and 
globular clusters. Some of these objects differ 
greatly in age, such as the young OB associations, 
on the one hand, and the old globular clusters, on 
the other. Any differences in their space distribu¬ 
tion tell us about changes in the general structure 
of the Milky Way. 

The young optical objects, the HII regions, 
OB associations and open clusters, are strongly 
concentrated in the plane of the Milky Way (Ta¬ 
ble 18.1). Figure 18.11 shows that they also ap¬ 
pear to be concentrated in three drawn-out bands, 
at least within the observed region. Since these 
types of objects in other galaxies are known to 
be part of a spiral structure, the observed bands 
in the Milky Way have been interpreted as por¬ 
tions of three spiral arms passing through the so¬ 
lar neighbourhood. Stars of later spectral types 
seem to be much more evenly distributed. Apart 
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Table 18.1 Populations of the Milky Way; z is the vertical distance from the galactic plane, and u r the velocity com¬ 
ponent perpendicular to the galactic plane 


Population 

Typical objects 

Average age 
[ 10 9 a] 

z [pc] 

v r [k/s] 

Metal 

abundance 

Halo population II 

Subdwarfs, globular clusters RR Lyr 
(P > 0.4 d) 

14-12 

2000 

75 

0.001 

Intermediate population II 

Long period variables 

12-10 

700 

25 

0.005 

Disc population 

Planetary nebulae, novae bright red 
giants 

12-2 

400 

18 

0.01-0.02 

Old population I 

A stars, Me dwarfs classical cepheids 

2-0.1 

160 

10 

0.02 

Young population I 

Gas, dust, supergiants, TTau stars 

0.1 

120 


0.03-0.04 


Fig. 18.11 The 

distribution of various 
objects in the galactic 
plane. Three condensations 
can be discerned: the 
Sagittarius arm (lowest), 
the local arm near the Sun 
and (outermost) the 
Perseus arm 


180° O O-BO associations 



from a few special directions, interstellar dust 
limits observations in the galactic plane to within 
3-4 kpc. 

Old objects, particularly the globular clusters, 
have an almost spherical distribution about the 
centre of the Milky Way (Fig. 18.12). The space 
density of old objects increases towards the galac¬ 
tic centre. They can be used to determine the dis¬ 
tance of the Sun from the galactic centre; the 
value of this distance is about 8.5 kpc. More 
recent measurements using other methods have 
given a distance of 8 kpc (26,000 light years). 

Stellar Populations Studies of the motions of 
the stars in the Milky Way have revealed that the 


orbits of stars moving in the galactic plane are al¬ 
most circular. These stars are also usually young, 
a few hundred million years at the most. They 
also contain a relatively large amount of heavy 
elements, about 2-4 %. The interstellar mate¬ 
rial similarly moves in the galactic plane in al¬ 
most circular orbits. On the basis of their motions 
and their chemical composition, the interstellar 
medium and the youngest stars are collectively 
referred to as population I. 

Outside the plane of the Milky Way, an almost 
spherically symmetric halo extends out to over 
50 kpc and even further out there is a corona. The 
stellar density is largest near the galactic centre 
and decreases outwards. The halo contains very 
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Fig. 18.12 The 

distribution of globular 
clusters. (From S.R. 
Majewski, Stellar 
populations and the Milky 
Way, in C. Martinez Roger, 
I. Perez Fournon, 

F. Sanchez (Eds.) Globular 
Clusters, Cambridge 
University Press, 1999) 
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little interstellar matter, and its stars are old, per¬ 
haps up to 13 x 10 9 years. These stars are also 
very metal-poor. Their orbits may be very ec¬ 
centric and show no preference for the galactic 
plane. On the basis of these criteria, one defines 
stars of population II. Typical population II ob¬ 
jects are the globular clusters, and the RR Lyrae 
and W Virginis stars. 

While the Sun and the closest stars move much 
like the stars of population I, the stars of popula¬ 
tion II have large velocities with respect to the lo¬ 
cal standard of rest, up to more than 300 km s _1 . 
In reality their velocities at the solar distance 
from the galactic centre are quite small and may 
sometimes be opposite to the direction of mo¬ 
tion of the LSR. The large relative velocities 
only reflect the motion of the LSR with a veloc¬ 
ity of about 220 km s’ 1 round the galactic cen¬ 
tre. 

Between these two extremes, there is a se¬ 
quence of intermediate populations. In addition 
to populations I and II, one generally also speaks 
of a disk population, including the Sun, for in¬ 
stance. The typical motions, chemical compo¬ 
sition and age of the various populations (Ta¬ 
ble 18.1) contain information about the evolu¬ 
tion of our Galaxy and about the formation of its 
stars. 


18.3 The Rotation of the Milky Way 

Differential Rotation. Oort’s Formulas The 

flatness of the Milky Way is already suggestive 
of a general rotation about an axis normal to the 
galactic plane. Observations of the motions both 
of stars and of interstellar gas have confirmed 
this rotation and shown it to be differential. This 
means that the angular velocity of rotation de¬ 
pends on the distance from the galactic centre 
(Fig. 18.13). Thus the Milky Way does not ro¬ 
tate like a rigid body. Near the Sun, the rotational 
velocity decreases with radius. 

The observable effects of the galactic rotation 
were derived by the Dutch astronomer Jan H. 
Oort. Let us suppose the stars are moving in cir¬ 
cular orbits about the galactic centre (Fig. 18.14). 
This approximation is acceptable for population I 
stars and gas. The star S, seen from the Sun O 
at galactic longitude / at distance r, has circular 
velocity V at a distance R from the centre. Sim¬ 
ilarly for the Sun the galactic radius and velocity 
are Rq and Vo- The relative radial velocity iy of 
the star with respect to the Sun is the difference 
between the projections of the circular velocities 
on the line of sight: 

ly = V cos a — Vo sin/, (18.6) 
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Fig. 18.13 The effect of differential rotation on the ra¬ 
dial velocities and proper motions of stars, (a) Near the 
Sun the orbital velocities of stars decrease outwards in the 
Galaxy, (b) The relative velocity with respect to the Sun 
is obtained by subtracting the solar velocity from the ve¬ 


locity vectors in (a), (c) The radial components of the ve¬ 
locities with respect to the Sun. This component vanishes 
for stars on the same orbit as the Sun. (d) The tangential 
components of the velocities 



Fig. 18.14 In order to derive Oort’s formulas, the veloc¬ 
ity vectors of the Sun and the star S are divided into com¬ 
ponents along the line 05 and normal to it 


theorem to the triangle CSQ one obtains 


sin (a + 90°) Rq 

sin / R 


or 

R 0 

cosa = —sin/. (18.7) 

R 

Denoting the angular velocity of the star by 
co — V/R and that of the Sun by &>o = Vo/Rq, 
one obtains the observable radial velocity in the 
form 


u r = Rq(co — coq) sin/. (18.8) 

The tangential component of the relative ve¬ 
locity of the Sun and the star is obtained as fol¬ 
lows. From Fig. 18.14, 

v t = V sina — VqcosI = Roosina — RqcoqcosI. 


The triangle O CP gives 


R sin a = Rq cos / — r, 

where a is the angle between the velocity vector 

and hence 

of the star and the line of sight. From Fig. 18.14 

the angle CSQ = a + 90°. By applying the sine v t = Ro(a> — co o)cos/ — cor. (18.9) 
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Oort noted that in the close neighbourhood of 
the Sun (r <<C Rq), the difference of the angular 
velocities will be very small. Therefore a good 
approximation for the exact equations (18.8) and 
(18.9) is obtained by keeping only the first term of 
the Taylor series of co — coq in the neighbourhood 
of R = R 0 : 

( d&>\ 

Using co = V/R and V ( Rq ) = Vq, one finds 


C0~ COQ 


R l 


, dV\ 

/? ° ( di?)* =So 



(R - R 0 ). 


For R ss Rq r, the difference R — Rq & 
—r cos/. One thus obtains an approximate form 




'Vb 

/dV\ 

_Ro 

\dRJ R=R 0 . 


r cos / sin / 


or 


v T Ar sin2/, 


(18.10) 


where A is a characteristic parameter of the solar 
neighbourhood of the Galaxy, the first Oort con¬ 
stant: 


A=- 


1 

'Vo 

(W\ 1 

2 

_Ro 

KdRJ r=r 0 _ 


(18.11) 


For the tangential relative velocity, one simi¬ 
larly obtains, since cor coor: 


V t : 


'Vo 

(—) 

_Ro 

\dR/ R=R 0 . 


r cos 2 / — coor. 


Because 2 cos 2 / = 1 + cos 21, this may be written 
i>t Rs Ar cos2/ + Br, (18.12) 


where A is the same as before and B, the second 
Oort constant, is 


1 

ruo 

fdy\ 

2 


\dR J R=R 0 . 


(18.13) 




Fig. 18.15 The velocity components due to differen¬ 
tial rotation according to Oort’s formulas as functions 
of galactic longitude, (a) Radial velocities for objects at 
a distance of 1 and 2 kpc. (Compare with Fig. 18.13.) 
Strictly, the longitude at which the radial velocity vanishes 
depends on the distance. Oort’s formulas are valid only in 
the close vicinity of the Sun. (b) Proper motions 


Equation (18.10) says that the observed radial 
velocities of stars at the same distance should 
be a double sine curve as a function of galactic 
longitude. This has been confirmed by observa¬ 
tions (Fig. 18.15(a)). If the distance to the stars 
involved is known, the amplitude of the curve de¬ 
termines the value of the Oort constant A. 

Independently of distance, the proper motions 
of the stars form a double sine wave as a function 
of galactic longitude, as seen in Fig. 18.15(b). 
The amplitude of the curve is A and its mean 
value, B. 

In 1927 on the basis of this kind of analy¬ 
sis, Oort established that the observed motions 
of the stars indicated a differential rotation of 
the Milky Way. Taking into account an extensive 
set of observational data, the International As¬ 
tronomical Union IAU has confirmed the present 
recommended values for the Oort constants: 


The proper motion (i = i\/r is then given by the A = 15 km s 1 kpc 1 , B = -10 km s 1 kpc 1 . 

expression 

The Oort constants obey some interesting re- 
/r ~ Acos2/ + B. (18.14) lations. By subtracting (18.13) from (18.11), one 
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obtains 

A-B = ^=cd o. (18.15) 

R 0 

Adding (18.13) and (18.11) gives 

A + = . (18.16) 

\dR / r = r 0 

Knowing the values of A and B, one can cal¬ 
culate the angular velocity u >q = 0.005 3" /year, 
which is the angular velocity of the local standard 
of rest around the galactic centre. 

The circular velocity of the Sun and the LSR 
can be measured in an independent way by us¬ 
ing extragalactic objects as a reference. In this 
way a value of about 220 kms -1 has been ob¬ 
tained for Vo- Using (18.15) one can now calcu¬ 
late the distance of the galactic centre Rq. The 
result is about 8.5 kpc, in good agreement with 
the distance to the centre of the globular clus¬ 
ter system. The direction to the galactic centre 
obtained from the distribution of radial veloci¬ 
ties and proper motions by means of (18.10) and 
(18.14) also agrees with other measurements. 

The orbital period of the Sun in the Galaxy ac¬ 
cording to these results is about 2.5 x 10 8 years. 
Since the Sun’s age is nearly 5 x 10 9 years, it 
has made about 20 revolutions around the galac¬ 
tic centre. At the end of the previous revolution, 
the Carboniferous period had ended on Earth and 
the first mammals would soon appear. 

The Distribution of Interstellar Matter Ra¬ 
dio emission from interstellar gas, in particular 
that of neutral hydrogen, is not strongly absorbed 
or scattered by interstellar dust. It can therefore 
be used to map the structure of the Milky Way on 
large scales. Radio signals can be detected even 
from the opposite edge of the Milky Way. 

The position of a radio source, for example an 
HI cloud, in the Galaxy cannot be directly deter¬ 
mined. However, an indirect method exists, based 
on the differential rotation of the Galaxy. 

Figure 18.16 is a schematic view of a situation 
in which gas clouds on the circles P\, P 2 , ... are 
observed in the direction / (—90° < l < 90°). The 
angular velocity increases inwards, and therefore 
the greatest angular velocity along the line of 



Fig. 18.16 Clouds P\, P 2 , ... seen in the same direction 
at various distances 

sight is obtained at the point Pk, where the line 
of sight is tangent to a circle. This means that the 
radial velocity of the clouds in a fixed direction 
grows with distance up to the maximum velocity 
at cloud Pk: 

Wmax = R<fc(<w-«o), (18.17) 

where Rk — Rosin/. The distance of cloud Pi 
from the Sun is r = Rocos/. When r increases 
further, v T decreases monotonically. Figure 18.17 
shows how the observed radial velocity in a given 
direction varies with distance r, if the gas moves 
in circular orbits and the angular velocity de¬ 
creases outwards. 

The neutral hydrogen 21 cm line has been par¬ 
ticularly important for mapping the Milky Way. 
Figure 18.18 gives a schematic view of how the 
hydrogen spectral line is made up of the radia¬ 
tion of many individual concentrations of neutral 
hydrogen, clouds or spiral arms. The line com¬ 
ponent produced by each cloud has a wavelength 
which depends on the radial velocity of the cloud 
and a strength depending on its mass and density. 
The total emission is the sum of these contribu¬ 
tions. 

By making observations at various galactic 
longitudes and assuming that the clouds form at 
least partly continuous spiral arms, the distribu- 
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Fig. 18.17 The radial velocity as a function of distance 
(shown schematically) 

tion of neutral hydrogen in the galactic plane 
can be mapped. Figure 16.17 shows a map of 
the Milky Way obtained from 21 cm line obser¬ 
vations of neutral hydrogen. It appears that the 
neutral hydrogen is concentrated in spiral arms. 
However, interpretation of the details is difficult 
because of the uncertainties of the map. In order 
to obtain the distances to the gas clouds, one has 
to know the rotation curve, the circular velocity 
as a function of the galactic radius. This is deter¬ 
mined from the same radial velocity observations 
and involves assumptions concerning the density 
and rotation of the gas. The interpretation of the 
spiral structure obtained from radio observations 
is also still uncertain. For example, it is difficult 
to fit the radio spiral structure to the one obtained 
near the Sun from optical observations of young 
stars and associations. 

The Rotation, Mass Distribution and Total 
Mass of the Milky Way In (18.17) the galac¬ 
tic longitude / gives the galactic radius of the 
clouds with maximum radial velocity. By mak¬ 
ing observations at different longitudes, one can 
therefore use (18.17) to determine the angular ve¬ 
locity of the gas for various distances from the 
galactic centre. (Circular motions must be as¬ 
sumed.) In this way, the rotation curve a> — co(R) 



Fig. 18.18 Clouds at different distances have different 
velocities and therefore give rise to emission lines with 
different Doppler shifts. The observed flux density profile 
(continuous curx’e) is the sum of the line profiles of all 
the individual line profiles (dashed curves). The numbers 
of the line profiles correspond to the clouds in the upper 
picture 

and the corresponding velocity curve V = V(R) 
(= o) R ) are obtained. 

Figure 18.19 shows the rotation curve of the 
Milky Way. Its central part rotates like a rigid 
body, i.e. the angular velocity is independent of 
the radius. Outside this region, the velocity first 
drops and then begins to rise gradually. A maxi¬ 
mum velocity is reached at about 8 kpc from the 
centre. Near the Sun, a little further from the cen¬ 
tre, the rotational velocity is about 220 kms -1 . 
According to earlier opinions, the velocity con¬ 
tinues to decrease outwards. This would mean 
that most of the mass is inside the solar radius. 
This mass could then be determined from Ke¬ 
pler’s third law. According to (6.34), 

M=R 0 V$/G. 

Using the values Rq — 8.5 kpc and Vo = 
220 km s~ 1 , one obtains 

M= 1.9 x 10 41 kg = 1.0 x 10 11 M q . 
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Fig. 18.19 Rotation curve 
of the Milky Way based on 
the motions of hydrogen 
clouds. Each point 
represents one cloud. The 
thick line represents the 
rotation curve determined 
by Maarten Schmidt in 
1965. If all mass were 
concentrated within the 
radius 20 kpc, the curve 
would continue according 
to Kepler’s third law 
{broken line). The rotation 
curve determined by Leo 
Blitz on the basis of more 
recent observations begins 
to rise again at 12 kpc 


V 



The escape velocity at radius R is 
flGM r- 

V e = j— = VV2. (18.18) 

This gives an escape velocity near the Sun V e = 
310 kms -1 . One therefore should not see many 
stars moving in the direction of galactic rotation, 
/ = 90°, with velocities larger than 90 kms -1 
with respect to the local standard of rest, since the 
velocity of such stars would exceed the escape ve¬ 
locity. This has been confirmed by observations. 

The preceding considerations have been based 
on the assumption that near the Sun, the whole 
mass of the Galaxy can be taken to be concen¬ 
trated in a central point. If this were true, the 
rotation curve should be of the Keplerian form, 
V oc R -1 / 2 . That this is not the case can be estab¬ 
lished from the values of the Oort constants. 

The derivative of the Keplerian relation 


v = J— = Vgmr - 1 '2 

V R 


yields 


dV 

dR 


--Vgm r~ 3/2 = - 
2 


1 V 
2~R 


Using the properties (18.15) and (18.16) of the 
Oort constants, one finds 


(18.19) 


for a Keplerian rotation curve. This disagrees 
with the observed value and thus the assumed Ke¬ 
plerian law does not apply. 

The mass distribution in the Milky Way can 
be studied on the basis of the rotation curve. One 
looks for a suitable mass distribution, such that 
the correct rotation curve is reproduced. Recently 
distant globular clusters have been discovered, 
showing that the Milky Way is larger than ex¬ 
pected. Also, observations of the rotation curve 
outside the solar circle suggest that the rotational 
velocity might begin to rise again. These results 
suggest that the mass of the Galaxy might be as 
much as ten times larger than had been thought. 


18.4 Structural Components of the 
Milky Way 

We have seen how the structure of the Milky Way 
can be globally described by means of an almost 
spherical halo of old stars and a disk of gas and 
young and middle-aged stars. Due to its structure 
the Milky Way belongs to disk galaxies, which 
are discussed in the next chapter. In a more de¬ 
tailed picture more small-scale features can also 
be distinguished. 

The Thick Disk In the traditional scheme di¬ 
viding the stars of the Milky Way into a se¬ 
ries of populations it was left undecided whether 


(A-B)/(A + B) = 2 
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the populations should be considered as qualita¬ 
tively different classes or merely steps along a 
continuous sequence. As the quality and quan¬ 
tity of observations have improved, it has become 
clear that what was defined as the intermediate 
population II really represents a separate compo¬ 
nent of the Milky Way with a pattern of element 
abundances stellar motions that separate it clearly 
from the old (thin) disk. This population is now 
referred to as the thick galactic disk. A thick disk 
has also been detected in some other galaxies, but 
it does not appear to be a universal feature of all 
disk galaxies. 

The Galactic Bar As will be seen in the next 
chapter. Sect. 19.1, a large fraction of all disk 
galaxies are barred, with an elongated light distri¬ 
bution at the centre. The first indication that this 
might also be the case for the Milky Way was 
found in velocity measurements of neutral hydro¬ 
gen, which were incompatible with gas moving 
along circular orbits. In 1971 W.W. Shane showed 
that the motions of the gas could be explained, 
if there is a central bar pointing about 20° away 
from the direction of the Galactic centre. 

It is more difficult to detect a bar in observa¬ 
tions of the stars. This was first done using the 
COBE satellite, which (apart from mapping the 
cosmic microwave background, see Sect. 19.7) 
also made a map of the sky at the infrared wave¬ 
lengths dominated by the light of old stars. Be¬ 
cause of perspective, the nearer end of the bar at 
positive galactic longitude will look slightly dif¬ 
ferent than the farther end. Such an asymmetry 
was present in the infrared map, consistent with 
a bar with an axial ratio of 0.6. 

Later confirmation of the existence of the bar 
has come from mapping of the central distribu¬ 
tion of old stars using near-infrared photometric 
distances, see Fig. 18.20. 

Spiral Structure As mentioned earlier, the 
Milky Way appears to be a spiral galaxy 
(Fig. 18.21), but there is no general agreement on 
the detailed form of the spiral pattern. For exam¬ 
ple, in 1976 Y.M. Georgelin and Y.P. Georgelin 
determined the distances of HII regions by ra¬ 
dio and optical observations. In the optical region 



Fig. 18.20 The Milky Way bar viewed from the North 
Galactic pole. Solid symbols indicate the mean positions 
of red clump giants in given directions, and the thick 
grey lines represent the range of their distances. The line 
through the mean distances illustrates a bar of 3 kpc length 
inclined 22.5° to the direction to the Galactic centre. 
The contour map shows a model of the bar derived from 
infrared observations. (C. Babusiaux, G. Gilmore 2005, 
MNRAS 358, 1309, Fig. 6) 
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Fig. 18.21 General view of the spiral pattern of the Milky 
Way. Different tracers of spiral arms lead to somewhat dif¬ 
ferent patterns, but they tend to agree that a four-armed 
pattern like the one indicated here gives the best over¬ 
all representation. The names of the arms are those most 
commonly used. See also Fig. 18.11. (Y. Xu et al. 2006, 
Science 311, 54) 
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Fig. 18.22 Left: The central parts of the Milky Way in a 
radio picture. The observations were made with the VLA 
telescope. (Image Kassim et al., Naval Research Labora¬ 
tory.) Right. The image by the Chandra satellite shows the 

their method is independent of assumptions about 
the galactic rotation law. They then fitted four spi¬ 
ral arms through the HII regions. 

Later investigations using a variety of meth¬ 
ods, both optical and radio have confirmed that 
a four-armed pattern gives the best descrip¬ 
tion of the spiral structure in the Sun’s vicinity 
(Fig. 18.22). The pitch angle of the spiral in this 
model is about 11.3°. Three of the arms start at 
the position of the galactic bar. 

The cause of the spiral structure is a long¬ 
standing problem. A small perturbation in the 
disk will quickly be stretched into a spiral shape 
by differential rotation. However, such a spiral 
would disappear in a few galactic revolutions, 
a few hundred million years. 

An important step forward in the study of the 
spiral structure was the density wave theory de¬ 
veloped by Chia-Chiao Lin and Frank H. Shu 
in the 1960’s. The spiral structure is taken to be 
a wavelike variation of the density of the disk. 
The spiral pattern rotates as a solid body with an 
angular velocity smaller than that of the galactic 
rotation, while the stars and gas in the disk pass 
through the wave. 


surroundings of the supermassive black hole SgraA* at the 
centre of the Milky Way. The width of the picture is 8.4', 
corresponding 20 parsecs at the distance of the centre of 
the Milky Way. (NASA/CXC/MIT/F.K. Baganoff et al.) 

The density wave theory explains in a natural 
way why young objects, like molecular clouds, 
HII regions and bright young stars are found in 
the spiral arms. As gas passes through the wave, 
it is strongly compressed. The internal gravity of 
the gas clouds then becomes more important and 
causes them to collapse and form stars. 

It takes about 10 7 years for the material to pass 
through a spiral arm. By that time, the hot, bright 
stars have finished their evolution, their ultravio¬ 
let radiation has ceased and the HII regions have 
disappeared. The less massive stars formed in the 
spiral arms are spread out in the disk by their pe¬ 
culiar velocities. 

It is not yet clear what gives rise to the spiral 
wave. For some further discussion of spiral struc¬ 
ture, see Sect. 19.4. 

The Galactic Centre Our knowledge of the 
centre of the Milky Way is mostly based on radio 
and infrared observations (Fig. 18.22). In the op¬ 
tical region the view towards the centre is blocked 
already by the dark clouds in the Sagittarius spiral 
arm about 2 kpc from us. The galactic centre is in¬ 
teresting because it may be a small-scale version 
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of the much more violently active nuclei of some 
external galaxies (see Sect. 19.7). It therefore pro¬ 
vides opportunities to study at close hand phe¬ 
nomena related to active galaxies. At the galac¬ 
tic centre there us supermassive black hole with a 
mass of nearly 5 x 10 6 Mq 

As one approaches the galactic centre the stel¬ 
lar density continues to rise towards a sharp cen¬ 
tral peak. In contrast, the galactic gas disk has 
a central hole of radius about 3 kpc. This may 
be due to the galactic bar, which will channel gas 
into the galactic nucleus leaving a gas-free zone 
at larger radii. 

Inside the central hole there is a dense gas 
disk. Its radius is about 1.5 kpc in neutral hy¬ 
drogen, but most of its mass is molecular and 
concentrated within 300 pc of the nucleus. In 
this region the mass of molecular gas is about 
10 8 Mq, or 5 % of the total molecular mass of 
the Milky Way. The molecular clouds are prob¬ 
ably confined by the pressure from surrounding 
very hot (T ~ 10 8 K) gas. This hot gas may then 
expand vertically, forming a galactic wind. Gas 
lost to a wind or to star formation is replenished 
with infalling gas from larger radii. 

The central 10 pc are dominated by the ra¬ 
dio continuum source SgrA and a dense star 
cluster observed in the infrared. There is also 
molecular gas with complex motions and signs 
of star formation activity. Within SgrA there 
is a unique point-like radio continuum source 
known as SgrA*. The position of SgrA* agrees 
to within 1" with the centre of a cluster of stars 
that is much denser than anything observed in the 
galactic disk. 

The stellar density of the central star cluster 
can be as high as 3 x 10 14 stars per cubic parsec, 
while around the Sun it is just one star per cubic 
parsec. The stars of the cluster orbit the central 
black hole. Motions of the innermost stars have 
been followed since the end of the 1990’s. A cou¬ 
ple of stars have already managed to make a full 
revolution around the centre (Fig. 18.23), the pe¬ 
riods being 16 and 11.5 years. When the period is 
known and the size of the orbit is seen the mass of 
the central body can be calculated. The researches 
working with the Keck telescope got in 2014 a 
value of 4.6 ± 0.7 x 10 6 solar masses. 



Fig. 18.23 Motions of stars near the centre of the Milky 
Way during over 15 years. The background image was 
taken in 2014 with the Keck telescope. (Kuva Andrea 
Ghez/UCLA/W.M. Keck Telescopes) 

According to VLBI measurements the diam¬ 
eter of the accretion disk surrounding the black 
hole is less than 10 au. Eruptions observed at 
infrared wavelengths, lasting only about twenty 
minutes, point to the same size. It can be calcu¬ 
lated that the diameter of the black hole is about 
0.2 au. 


18.5 The Formation and Evolution 
of the Milky Way 

Like all galaxies the Milky Way is supposed to 
have formed through the collapse of a region of 
higher than average density in the Universe. The 
general view of how this happened will be pre¬ 
sented in the next two chapters,in particular in 
Sects. 19.8 and 20.7. Some traces of the forma¬ 
tion process will be preserved in the properties of 
local stars of different ages. These represent in¬ 
formation about the formation of the Milky Way 
that is not available for other galaxies. 

The Ages of Stars The most direct way of 
studying the evolution of the Milky Way is by 
means of the observed ages of stars. The tra¬ 
ditional sequence of populations described in 
Sect. 18.2 is also a sequence corresponding to 











384 


18 The Milky Way 


Fig. 18.24 The 

age-metallicity relation of 
the Milky Way for different 
components. TDO: thin 
disc open clusters; TDG: 
thick disc globulars; 

B: bulge; YHG: young 
halo globulars; OHG: old 
halo globulars. The shaded 
areas correspond to thin 
disc field stars, thick disc 
field stars, and halo field 
stars, in order of increasing 
age. (K. Freeman, 

J. Bland-Hawthom 2002, 
ARAA 40, 487, Fig. 2) 



stars of different ages. The oldest component, the 
stellar halo forms an almost spherical distribu¬ 
tion of stars with ages 12-14 Ga, representing the 
oldest part of the Milky Way. 

In contrast, the old and young population I 
consists of stars with ages less than 10 Ga, which 
were initially formed in a very thin layer that has 
been thickened by encounters with spiral arms 
and molecular cloud complexes. 

Intermediate between these two, as we have 
noted, is the thick disk with an age about 10- 
12 Ga. Another intermediate population in the 
Milky Way is the central bulge including the 
galactic bar, and containing stars of ages 7-11 Ga. 

Chemical Enrichment The history of the for¬ 
mation of the Milky Way is preserved in the prop¬ 
erties of its older stars, above all in their “chemi¬ 
cal” composition, i.e. the abundances of elements 
and isotopes heavier than helium. (All these heav¬ 
ier elements are often referred to as “metals” in 
the astrophysical jargon.) 

Only hydrogen and helium were present as the 
first stars formed. As successive generations of 
stars evolved, nuclear reactions produced heavy 
elements, some of which were returned to the in¬ 
terstellar gas in supernova explosions or stellar 
winds. These heavy elements were then incorpo¬ 
rated into the following generation of stars, grad¬ 


ually increasing the metal abundance in the inter¬ 
stellar medium. 

Some of the stars formed were of low mass, 
and had life-times long enough that they are still 
present. Their chemical composition will reflect 
the abundances in the interstellar medium at the 
time they were formed. Studying the chemical 
abundances of stars of different ages therefore 
gives information on the star formation history of 
the Milky Way, both on the rate at which stars 
have formed at a given epoch and on the masses 
and other characteristics of those stars when they 
formed. 

One commonly used indicator of the metal 
abundance in stars is the mass of iron relative 
to hydrogen, in logarithmic units and measured 
relative to the solar values, [Fe/H], Figure 18.24 
shows the value of [FE/H] for different types of 
objects of various ages. The general picture is 
that the metal abundance rose rapidly during the 
first billion years and has afterwards grown only 
slowly. The lowest values of [Fe/H] discovered in 
old halo stars are about —5. 

Many models have been developed describ¬ 
ing the chemical evolution of the Milky Way 
and other galaxies, incorporating prescriptions 
for star formation, and infall of gas from the 
outside. In particular the rapid early rise of the 
metal abundance is difficult to explain in the sim- 
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plest models. This difficulty, the scarcity of old 
metal-poor disk stars, has become known as the 
“G dwarf problem”, since the oldest stars still 
on the main series are of spectral class G. The 
most direct way that the G dwarf problem can be 
avoided is to suppose that a large portion of the 
interstellar gas was accreted after the formation 
of the oldest stars. 

Formation of the Milky Way Galaxies are 
thought to have formed when gas clouds of higher 
than average density in the Universe collapsed 
under their own gravity. As the gas is compressed 
stars are born in it. After the end of the collapse 
the cloud settles into a quasi-stationary state. 
Evolution continues at a slower pace. Evolving 
stars return chemically enriched gas to interstel¬ 
lar space, where it is mixed with remaining, un¬ 
processed gas and star formation continues. 

There are two competing pictures of how 
galaxy formation proceeds. In the monolithic col¬ 
lapse model it is assumed that large galaxies form 
coherently in the collapse of a massive cloud con¬ 
taining the bulk of the material constituting the 
galaxy. In the hierarchical picture most of the 
stars form in much smaller clouds, which later ag¬ 
glomerate together to form the galaxies we now 
observe. The evidence for these pictures in other 
galaxies will be considered in the next chapter. 

The case of the Milky Way already shows that 
the contrast of the monolithic and the hierarchical 
picture is oversimplified. Some aspects of galac¬ 
tic structure fit more naturally into a monolithic 
theory. For example, the rapid collapse of the 
halo, followed by a more gradual build-up of the 
disk fits with this picture. The pattern of chemical 
abundances are also homogeneous in a way that is 
more compatible with a pattern of star formation 
that was uniform throughout galactic history. 

Other observations are indicative of a hier¬ 
archical formation history. For example, stellar 
abundance patterns in the thick disk differ from 
those in the old thin disk. The most natural way of 
explaining this is that the thick disk formed by the 
accretion of one or more small satellite galaxies 
with different histories of star formation. Another 
sign of the importance of the infall of satellite 
galaxies is the existence of systems such as the 


Sagittarius dwarf galaxy, which appears to be in 
the course of being disrupted by the Milky Way. 

There is about five times as much dark matter 
as visible matter. Recent simulations have con¬ 
firmed that this dark matter has an essential role 
in the formation of galaxies. During the first bil¬ 
lion years of the universe the dark matter formed 
gravitational condensates to which the visible 
matter was concentrated to form initial galax¬ 
ies. Formation of the current galaxies has been 
a complicated process that involved effects of the 
dark matter, interactions, collisions and mergers 
of galaxies. 


18.6 Examples 

Example 18.1 Show that if the stars are uni¬ 
formly distributed in space and there is no in¬ 
terstellar extinction, the number of stars brighter 
than apparent magnitude m is 

No(m) = Ao(0) x 10 o6m . 

Let us suppose first that all stars have the same 
absolute magnitude M. The distance in parsecs of 
those stars having the apparent magnitude m is 

r = 10 x 10°' 2(m “ M) . 

In order to appear brighter than m , a star has to 
be within a sphere of radius r. Because the stellar 
density is constant, the number of such stars is 
proportional to the volume: 

No(m) oc r 3 oc 10°' 6m . 

The result does not depend on the absolute 
magnitudes of the stars, so that the same result 
still applies when the magnitudes are not equal, 
as long as the luminosity function does not de¬ 
pend on distance. Thus the equation is generally 
valid under the stated conditions. 

Example 18.2 (The Estimation of Distances by 
Means of Oort’s Formulas) An object in the 
galactic plane at longitude / = 45° has the radial 
velocity of 30 kms~' with respect to the LSR. 
What is its distance? 
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According to (18.10), 

v T — Ar sin 2 1. 

Thus 

v r 30 kms -1 

r = — ; -=- T = 2 kpc. 

A sin 2/ 15 kms -1 kpc 

In practice, the peculiar velocities are so large that 
this method cannot be used for distance determi¬ 
nation. Oort’s formulas are mostly suitable only 
for statistical studies. 

Example 18.3 (Discussion of the Gravitational 
Field of a Uniform Disk) It can be shown that the 
gravitational field of a homogeneous infinite thin 
disk is constant and directed towards the plane of 
the disk. If the mass per unit area of the disk is a , 
the gravitational field is 

g = 2 ttGcj. 

A test particle located outside the plane will 
therefore get a constant acceleration towards 
the plane. Taking a numerical example, assume 
a mass of 10 11 Mq distributed uniformly on a cir¬ 
cular disk 20 kpc in diameter. The mass per unit 
area will be a 

10 u x 2 x 10 30 kg 
~~ 7t(10 4 x 3.086 x 10 16 m) 2 

= 0.67 kg m -2 . 

The corresponding gravitational field is 
g = 2.8 x 10 -10 ms -2 . 

Let a star be located at d — 1 kpc above the plane, 
initially at rest (not very near the edge of the disk, 
in order to keep our approximation valid). The 
disk will pull it towards the plane, and when the 
star crosses the plane, it has acquired a velocity 
given by 


The time required to reach the plane is 
t — v/g — 15 x 10 6 a. 


18.7 Exercises 

Exercise 18.1 Assume that the Sun and a star 
move around the Galaxy at the same speed in the 
same circular orbit in the galactic plane. Show 
that the proper motion of the star is independent 
of its distance. How big is this proper motion? 

Exercise 18.2 (a) A cepheid has a radial velocity 
of 80 kms -1 , and its galactic longitude is 145°. 
What is the distance of the cepheid? 

(b) The period of the cepheid is 3.16 d, and 
the apparent visual magnitude is 12.3. What is the 
distance derived from this information? Are the 
distances consistent? 

Exercise 18.3 (a) How many of the nearest stars 
(Table C. 15) are also among the brightest stars 
(Table C. 16)? Explain. 

(b) If the stellar density were constant, how 
many stars would there be within the distance of 
Canopus? 

Exercise 18.4 (a) Assume that the Galaxy is 
a homogeneous disk and the Sun lies in the cen¬ 
tral plane of the disk. The absolute magnitude of 
a star is M, galactic latitude b, and distance from 
the central plane z. What is the apparent magni¬ 
tude of the star, if the extinction inside the Galaxy 
is a mag kpc -1 ? 

(b) Assume that the thickness of the galactic 
disk is 200 pc. Find the apparent magnitude of 
a star with M — 0.0, b — 30°, distance r — 1 kpc, 
and a — 1 mag kpc -1 . 


v = ^2gd — 130 kms 1 . 
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The galaxies are the fundamental building blocks 
of the Universe. Some of them are very simple in 
structure, containing only normal stars and show¬ 
ing no particular individual features. There are 
also galaxies that are almost entirely made of neu¬ 
tral gas. On the other hand, others are complex 
systems, built up from many separate interact¬ 
ing components—stars, neutral and ionised gas, 
dust, molecular clouds, magnetic fields, cosmic 
rays.... The galaxies may form small groups or 
large clusters in space. At the centre of many 
galaxies, there is a compact nucleus that may 
sometimes be so bright that it overwhelms all the 
normal radiation of the galaxy. 

The luminosity of the brightest normal galax¬ 
ies may correspond to 10 12 solar luminosities, 
but most of them are much fainter—the smallest 
ones that have been discovered are about 10 5 Lq. 
Since galaxies do not have a sharp outer edge, 
to some extent their masses and radii depend on 
how these quantities are defined. If only the bright 
central parts are included, a giant galaxy may typ¬ 
ically have a mass of about 10 13 M 0 , and a radius 
of 30 kpc, and a dwarf, correspondingly, 10 7 Mq, 
and 0.5 kpc. 

When the existence of dark matter was con¬ 
firmed the whole field of galactic research was 
changed. It was only the dark matter that helped 
to give a consistent model for the origin and evo¬ 
lution of galaxies. 

The fraction of dark matter of the total mass of 
galaxies varies considerably. In big galaxies there 
is about five times as much dark matter as ordi¬ 
nary visible matter. In the smallest galaxies the 


ratio of the dark matter to the visible matter can 
be hundreds or even thousands. 

When smaller and smaller galaxies have been 
found the boundary between star clusters and 
galaxies has become obscure. The smallest galax¬ 
ies are similar to dense star clusters. Therefore 
some researches have redefined galaxies: a star 
system is a galaxy if in addition to stars it con¬ 
tains also dark matter. 


19.1 The Classification of Galaxies 

A useful first step towards an understanding of 
galaxies is a classification based on their various 
forms. Although such a morphological classifica¬ 
tion must always be to some extent subjective, it 
provides a framework within which the quanti¬ 
tative properties of galaxies can be discussed in 
a systematic fashion. However, it should always 
be remembered that the picture thus obtained will 
be limited to those galaxies that are large and 
bright enough to be easily visible in the sky. An 
idea of the consequent limitations can be obtained 
from Fig. 19.1, showing the radii and magnitudes 
of normal galaxies. One sees that only within 
a narrow region of this diagram can galaxies be 
easily found. If a galaxy has too large a radius for 
its magnitude (small surface brightness), it will 
disappear in the background light from the night 
sky. On the other hand, if its radius is too small, 
it looks like a star and is not noticed on a photo¬ 
graphic plate. In the following, we shall mainly 
be concerned with bright galaxies that fit within 
these limits. 
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If a classification is to be useful, it should 
at least roughly correspond to important phys- 



Fig. 19.1 Magnitudes and diameters of observable extra- 
galactic objects. Objects to the upper left look like stars. 
The quasars in this region have been discovered on the ba¬ 
sis of their spectra. Objects to the lower right have a sur¬ 
face brightness much smaller than that of the night sky. 
In recent years large numbers of low surface brightness 
galaxies have been discovered in this region. (Arp, H. 
(1965): Astrophys. J. 142, 402) 


ical properties of the galaxies. Most classifica¬ 
tions accord in their main features with the one 
put forward by Edwin Hubble in 1926. Hubble’s 
own version of the Hubble sequence is shown 
in Fig. 19.2. The various types of galaxies are 
ordered in a sequence from early to late types. 
There are three main types: elliptical , lenticu¬ 
lar, and spiral galaxies. The spirals are divided 
into two sequences, normal and barred spirals. 
In addition, Hubble included a class of irregular 
galaxies. In lenticular and spiral galaxies the mat¬ 
ter is concentrated in a flat disk and thus they can 
be called collectively as disk galaxies. 

The elliptical galaxies (Fig. 19.6) appear in the 
sky as elliptical concentrations of stars, in which 
the density falls off in a regular fashion as one 
goes outwards. Usually there are no signs of in¬ 
terstellar matter (dark bands of dust, bright young 
stars). The ellipticals differ from each other only 
in shape and on this basis they are classified as 
E0, El,..., E7. If the major and minor axes of an 
elliptical galaxy are a and b, its type is defined to 
be E n , where 

b\ 

1--J. (19.1) 

An E0 galaxy thus looks circular in the sky. The 
apparent shape of an E galaxy depends on the 
direction from which it is seen. In reality an 
E0 galaxy may therefore be truly spherical or 
it may be a circular disk viewed directly from 
above. 




Fig. 19.2 The Hubble sequence in Hubble’s 1936 ver- (S and Irr II); 18.5 (Irr I and dE). (Hubble, E.P. (1936): 
sion. At this stage the existence of type SO was still The Realm of the Nebulae (Yale University Press, New 
doubtful. Photographs of the Hubble types are shown in Haven)) 

Figs. 18.6 and 18.15 (E); 18.3 and 18.4 (SO and S); 18.12 
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Fig. 19.3 The classification of normal spiral and SO galaxies. (Mt. Wilson Observatory) 


A later addition to the Hubble sequence is 
a class of giant elliptical galaxies denoted cD. 
These are generally found in the middle of clus¬ 
ters of galaxies. They consist of a central part 
looking like a normal elliptical surrounded by an 
extended fainter halo of stars. 

In the Hubble sequence the lenticulars or 
SO galaxies are placed between the elliptical and 


the spiral types. Like the ellipticals they contain 
only little interstellar matter and show no signs 
of spiral structure. However, in addition to the 
usual elliptical stellar component, they also con¬ 
tain a flat disk made up of stars. In this respect 
they are like spiral galaxies (Figs. 19.3, 19.4). 

The characteristic feature of spiral galaxies is 
a more or less well-defined spiral pattern in the 
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Fig. 19.4 Different types of SBO and SB galaxies. The type (r) or (s) depends on whether the galaxy has a central ring 
or not. (Mt. Wilson Observatory) 


disk. Spiral galaxies consist of a central bulge, 
which is structurally similar to an E galaxy, and 
of a stellar disk, like in an SO galaxy. In addi¬ 
tion to these, there is a thin disk of gas and other 
interstellar matter, where young stars are being 
born, forming the spiral pattern. There are two se¬ 
quences of spirals, normal Sa-Sb-Sc, and barred 


SBa-SBb-SBc spirals. In the barred spirals the 
spiral pattern ends at a central bar, whereas in 
the normal spirals the spiral pattern may end at 
an inner ring or continue all the way to the cen¬ 
tre. The position of a galaxy within the spiral se¬ 
quence is determined on the basis of three criteria 
(which are not always in agreement): later types 
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Fig. 19.5 Above: The 
Small Magellanic Cloud 
(Hubble type Irr I), a dwarf 
companion of the Milky 
Way (Royal Observatory, 
Edinburgh). Below: The 
Sculptor Galaxy, a dE 
dwarf spheroidal. (ESO) 



have a smaller central bulge, more narrow spiral 
arms and a more open spiral pattern. The Milky 
Way Galaxy is thought to be of type SABbc (in¬ 
termediate between Sb and Sc, and between nor¬ 
mal and barred spirals). 

The classical Hubble sequence is essentially 
based on bright galaxies; faint galaxies have been 
less easy to fit into it (Fig. 19.5). For example, 
the irregular galaxies of the original Hubble se¬ 
quence can be divided into the classes Irr 1 and 


Irr II. The Irr I galaxies form a continuation of 
the Hubble sequence towards later types beyond 
the Sc galaxies. They are rich in gas and contain 
many young stars. Type Irr II are somewhat irreg¬ 
ular small ellipticals containing only few young 
stars but a lot of interstellar dust. 

The smallest irregular galaxies are classified 
as dlrr (dwarf irregulars). Another large group 
of dwarf galaxies are the spherical or elliptic 
dwarf galaxies dSph (dwarf spheroidals) and dE 
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Fig. 19.6 M32 (type E2), 
a small elliptical 
companion of the 
Andromeda Galaxy. 
(NOAO/Kitt Peak 
National Observatory) 



(dwarf ellipsoids). They are much smaller than 
ordinary elliptic galaxies or the stellar density is 
much lower. Recently new kinds of even smaller 
galaxies have been found. The include ultrafaint 
dwarfs (uFd), blue compact dwarfs (BCD) and 
ultracompact dwarfs (UCD) (Fig. 19.7). 

In addition to the precise morphological clas¬ 
sification a simpler classification based on colours 
was introduced at the beginning of the 2000’s. 
The idea to classify fast the millions of new 
galaxies found by modern digital survey pro¬ 
grams as well as the very faint galaxies whose 
shape is difficult to distinguish. When galaxies 
are plotted like stars in a colour-magnitude dia¬ 
gram they are seen to form two distinct regions: 
red galaxies (red sequence) and blue galaxies 
(blue sequence) (Fig. 19.8). The red galaxies are 
mainly ellipticals with old stars and blue galaxies 
mainly spirals with younger stars. 


19.2 Luminosities and Masses 



Fig. 19.7 The ultracompact dwarf M60-UCD1 is close to 
the galaxy M60. The mass of the dwarf galaxy is 200 x 10 6 
solar masses but the radius is only 100 lightyears. (Photo 
NASA, ESA, CXC, J. Strader (Michigan State Univer¬ 
sity)) 


Distances In order to determine the absolute lu¬ 
minosities and linear dimensions of galaxies one 
needs to know their distances. Distances are also 
needed in order to estimate the masses of galax¬ 
ies, because these estimates depend on the abso¬ 
lute linear size. Distances within the Local Group 


can be measured by the same methods as in¬ 
side the Milky Way, most importantly by means 
of variable stars. On the very large scale (be¬ 
yond 50 Mpc), the distances can be deduced on 
the basis of the expansion of the Universe (see 
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Fig. 19.8 A colour-magnitude graph contains 144,000 
galaxies from the Sloan digital sky survey (SDSS). Bright¬ 
ness increases from left to right. Galaxies are clearly di¬ 
vided into two regions, red and blue. (Blanton et al„ 2003, 
ApJ592, 819) 

Sect. 20.1). In order to connect these two re¬ 
gions one needs methods of distance determina¬ 
tion based on the properties of individual galax¬ 
ies. 

To some extent local distances can be deter¬ 
mined using structural components of galaxies, 
such as the sizes of HII regions or the mag¬ 
nitudes of globular clusters. However, to mea¬ 
sure distances of tens of megaparsecs, one needs 
a distance-independent method to determine the 
absolute luminosities of entire galaxies. Several 
such methods have been proposed. For example, 
a luminosity classification has been introduced 
for late spiral types by Sidney van den Bergh. This 
is based on a correlation between the luminosity 
of a galaxy and the prominence of its spiral pat¬ 
tern. Nowadays this method is not considered ac¬ 
curate enough. 

Other distance indicators are obtained if there 
is some intrinsic property of the galaxy, which is 
correlated with its total luminosity, and which can 
be measured independently of the distance. Such 
properties are the colour, the surface brightness 
and the internal velocities in galaxies. All of these 
have been used to measure distances to both spi¬ 
ral and elliptical galaxies. 


For example, the absolute luminosity of a gala¬ 
xy should depend on its mass. The mass, in turn, 
will be reflected in the velocities of stars and gas 
in the galaxy. Accordingly there is a relationship 
between the absolute luminosity and the velocity 
dispersion (in ellipticals) and the rotational ve¬ 
locity (in spirals). Since rotational velocities can 
be measured very accurately from the width of 
the hydrogen 21-cm line, the latter relationship 
(known as the Tully-Fisher relation) is perhaps 
the best distance indicator currently available. 

The luminosity of the brightest galaxies in 
clusters has been found to be reasonably constant. 
This fact can be used to measure even larger dis¬ 
tances, providing a method which is important in 
cosmology. 

Luminosities The definition of the total lumi¬ 
nosity of a galaxy is to some extent arbitrary, 
since galaxies do not have a sharp outer edge. The 
usual convention is to measure the luminosity of 
a galaxy out to a given value of the surface bright¬ 
ness, e.g. to 26.5 mag/sq.arcsec. For a given Hub¬ 
ble type, the total luminosity L may vary widely. 

As in the case of stars, the distribution of 
galaxy luminosities is described by the luminos¬ 
ity function 0(L) (Fig. 19.9). This is defined so 
that the space density of galaxies with luminosi¬ 
ties between L and L + dL is 0(L)dL. It can 
be determined from the observed magnitudes of 
galaxies, once their distances have been estimated 
in some way. In practice, one assumes some suit¬ 
able functional form for 0(L), which is then fit¬ 
ted to the observations. One common form is 
Schechter’s luminosity function, 

0(L)dL = 0*^\~ L ' L *d^y (19.2) 

The values of the parameters 0*, L *, a are obser- 
vationally determined for different types of ob¬ 
jects; in general, they will be functions of posi¬ 
tion. 

The shape of the luminosity function is de¬ 
scribed by the parameters a and L*. The relative 
number of faint galaxies is described by a. Since 
its observed value is about —1.1, the density of 
galaxies grows monotonically as one goes to¬ 
wards fainter luminosities. The luminosity func¬ 
tion falls off steeply above the luminosity L*, 
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Luminosity (L/L*) 

0.05 0.1 0.2 0.5 1 2 5 10 



Fig. 19.9 Compound luminosity function of thirteen 
clusters of galaxies. The open symbols have been ob¬ 
tained by omitting the cD galaxies. The distribution is then 
well described by (19.2). The cD galaxies (filled symbols ) 
cause a deviation at the bright end. (Schechter, P. (1976): 
Astrophys. J. 203, 297) 

which therefore represents a characteristic lu¬ 
minosity of bright galaxies. The observed L* 
corresponds to an absolute magnitude M* = 
—21.0 mag. The corresponding magnitude for the 
Milky Way Galaxy is probably —20.2 mag. The 
cD giant galaxies do not obey this brightness dis¬ 
tribution; their magnitudes may be —24 mag and 
even brighter. 

The parameter <£* is proportional to the space 
density of galaxies and is therefore a strong func¬ 
tion of position. Since the total number density 
of galaxies predicted by relation (19.2) is infi¬ 
nite, we define n* = density of galaxies with 
luminosity > L*. The observed average value 
of n* over a large volume of space is n* = 
3.5 x 10 -3 Mpc -3 . The mean separation between 
galaxies corresponding to this density is 4 Mpc. 
Since most galaxies are fainter than L *, and 
since, in addition, they often belong to groups, 
we see that the distances between normal galax¬ 
ies are generally not much larger than their diam¬ 
eters. 


Masses The distribution of mass in galaxies is 
a crucial quantity, both for cosmology and for 
theories of the origin and evolution of galax¬ 
ies. Observationally it is determined from the 
velocities of the stars and interstellar gas. To¬ 
tal masses of galaxies can also be derived from 
their motions in clusters of galaxies. The results 
are usually given in terms of the corresponding 
mass-luminosity ratio M/L, using the solar mass 
and luminosity as units. The value measured in 
the solar neighbourhood of the Milky Way is 
M/L = 3. If M/L were constant, the mass distri¬ 
bution could be determined from the observed lu¬ 
minosity distribution by multiplying with M/L. 
Unfortunately, there is no universally valid value 
for the ratio M/L. 

The masses of elliptical galaxies may be ob¬ 
tained from the stellar velocity dispersion given 
by the broadening of spectral lines. The method 
is based on the virial theorem (see Sect. 6.10), 
which says that in a system in equilibrium, the 
kinetic energy T and the potential energy U are 
related according to the equation 

2T + U = 0. (19.3) 

Since ellipticals rotate slowly, the kinetic energy 
of the stars may be written 

T = Mu 2 /2, (19.4) 

where M is the total mass of the galaxy and v the 
velocity width of the spectral lines. The potential 
energy is 

U = -GM 2 /2R, (19.5) 

where R is a suitable average radius of the galaxy 
that can be estimated or calculated from the light 
distribution. Introducing (19.4) and (19.5) into 
(19.3) we obtain: 

M = 2v 2 R/G. (19.6) 

From this formula the mass of an elliptical 
galaxy can be calculated when v 2 and R are 
known. Some observations of velocities in ellipti- 
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Fig. 19.10 Velocity of rotation V(R) fkms -1 ] and ve- the former is not. (Davies, R.L. (1981): Mon. Not. R. As- 
locity dispersion a(R) [kms -1 ] as functions of radius tron. Soc. 194, 879) 

[kpc] for types E2 and E5. The latter galaxy is rotating, 


Fig. 19.11 Rotation 
curves for seven spiral 
galaxies. (Rubin, V.C., 
Ford, W.K., Thonnard, N. 
(1978): Astrophys. J. 
(Lett.) 225, LI07) 



cal galaxies are given in Fig. 19.10. These will be 
further discussed in Sect. 19.4. The value of M/L 
derived from such observations is about 10 within 
a radius of 10 kpc. The mass of a bright elliptical 
might thus be up to 10 13 Mq. 

The masses of spiral galaxies are obtained 
from their rotation curve v(R), which gives the 
variation of their rotational velocity with radius. 
Assuming that most of the mass is in the almost 
spherical bulge, the mass within radius R, M{R), 
can be estimated from Kepler’s third law: 

M(R) = Rv(R) 2 /G. (19.7) 

Some typical rotation curves are shown in 
Fig. 19.11. In the outer parts of many spirals. 


v(R) does not depend on R. This means that 
M(R) is directly proportional to the radius—the 
further out one goes, the larger the interior mass 
is. Since the outer parts of spirals are very faint, 
at large radii the value of M/L is directly pro¬ 
portional to the radius. For the disk, one finds 
that M/L = 8 for early and M/L — 4 for late 
spiral types. The largest measured total mass is 
2 x 10 12 Mq. 

In order to measure the mass at even larger 
radii where no emission can be detected, motions 
in systems of galaxies have to be used. One pos¬ 
sibility is to use pairs of galaxies. In principle, the 
method is the same as for binary stars. However, 
because the orbital period of a binary galaxy is 
about 10 9 years, only statistical information can 





















396 


19 Galaxies 



Fig. 19.12 Stars and hot gas in the Hydra galaxy clus¬ 
ter. Left : Optical image of the centre of the cluster. Right: 
An X-ray image of the same area in the same scale hy 

be obtained in this way. The results are still uncer¬ 
tain, but seem to indicate values of M/L = 20-30 
at pair separations of about 50 kpc. 

A fourth method to determine galaxy masses 
is to apply the virial theorem to clusters of galax¬ 
ies, assuming that these are in equilibrium. The 
kinetic energy T in (19.4) can then be calcu¬ 
lated from the observed redshifts and the poten¬ 
tial energy U, from the separations between clus¬ 
ter galaxies. If it is assumed that the masses of 
galaxies are proportional to their luminosities, it 
is found that M/L is about 200 within 1 Mpc of 
the cluster centre. However, there is a large vari¬ 
ation from cluster to cluster. 

There are also two newer and more accu¬ 
rate methods for mass determination. The fifth 
method utilises the X-ray radiation coming from 
the hot gas around galaxies or galaxy clusters 
(Fig. 19.12). The temperature of the gas around 
galaxy clusters is as high as 10 7 , even E8 K. Since 
cloud seems to be more or less stable there must 
be a massive gravitational well preventing the gas 
from escaping. The dependence between the total 
mass of the galaxy cluster and the gas tempera¬ 
ture is almost linear: the hotter the gas the more 
mass there is. The total mass of the richest galaxy 


the Chandra satellite. The temperature of the gas is about 
4 x 10 7 K. (Optical image UK Schmidt/DSSl/ESO, SRC, 
X-ray image NASA/CXC/SAO) 

clusters is over 10 15 solar masses, and only about 
1 % of this is in the form of visible matter. 

Gravitational lenses (Sect. 19.7) are the most 
accurate method. A light ray passing by a galaxy 
is bent and the amount of bending depends on the 
mass of the galaxy. The method is generally used 
to determine masses of galaxy clusters but it can 
also be used for individual galaxies if the lensing 
galaxy forms several images of the background 
object or clearly distorts its shape. Gravitational 
lenses have finally confirmed that only about one 
sixth of the mass of galaxies is ordinary matter 
(like stars or gas) and the rest dark matter. 

Dark Matter the most precise value for the 
dark matter has been obtained by the WMAP and 
Planck satellites which have studied the details 
of the cosmic background radiation. As it will be 
mentioned in Chap. 20, about 31 % of the total 
content (mass + energy) of the universe is mat¬ 
ter. The final results of the Planck satellite pub¬ 
lished in 2015 show that the contribution of ordi¬ 
nary matter is 5 % and of cold dark matter 26 %. 

The distribution of the dark matter can nowa¬ 
days be determined by weak and strong gravita¬ 
tional lenses. The result is that the dark matter 
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Fig. 19.13 Distribution of dark matter around the galaxy 
cluster MACS J0717.5+3745. The background image was 
taken by the Hubble telescope. Superimposed is the distri¬ 


bution of dark matter calculated using strong and weak 
gravitational lenses. (NASA, ESA, Harald Ebeling (Uni¬ 
versity of Hawaii at Manoa) & Jean-Paul Kneib (LAM)) 


appears as wide halos around galaxies and galaxy 
clusters (Fig. 19.13). 

The nature of the dark matter is not yet known. 
It interacts with baryons almost only by its grav¬ 
ity. It is known to be “cold”, which means that 
the velocities of the particles are low unlike with 
neutrinos that always travel almost at the speed of 
light. 

It is assumed that the dark matter consists 
of unknown elementary particles. Possible candi¬ 
dates are light axions or heavy particles predicted 
by supersymmetry theories (WIMP, weakly in¬ 
teracting massive particle) like neutralinos. Their 
mass might be of the order 100 GeV-1 TeV, or 
hundreds of proton masses. Underground scintil¬ 
lation detectors and the LHC accelerator at the 
CERN are used to search for particles of the dark 
matter. 


19.3 Galactic Structures 

Ellipticals and Bulges In all galaxies the oldest 
stars have a more or less round distribution. In the 
Milky Way this component is represented by the 
population II stars. Its inner parts are called the 


bulge, and its outer parts are often referred to as 
the halo. There does not appear to be any phys¬ 
ically significant difference between the bulge 
and the halo. The population of old stars can be 
best studied in ellipticals, which only contain this 
component. The bulges of spiral and SO galaxies 
are very similar to ellipticals of the same size. 

The surface brightness distribution in elliptical 
galaxies essentially depends only on the distance 
from the centre and the orientation of the major 
and minor axis. If r is the radius along the major 
axis, the surface brightness I ( r ) is well described 
by cle Vaucouleurs’ law. 



-3.33 



(19.8) 


The constants in (19.8) have been chosen so 
that half of the total light of the galaxy is ra¬ 
diated from within the radius r e and the sur¬ 
face brightness at that radius is / e . The parame¬ 
ters r e and 7 e are determined by fitting (19.8) to 
observed brightness profiles. Typical values for 
elliptical, normal spiral and SO galaxies are in 
the ranges r e — 1-10 kpc and I e corresponds to 
20-23 magnitudes per square arc second. 
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Fig. 19.14 The distribution of surface brightness in E and 
cD galaxies. Ordinate : surface magnitude, mag/sq.arcsec; 
abscissa: (radius [kpc]) 1 / 4 . Equation (18.8) corresponds 
to a straight line in this representation. It fits well with 
an E galaxy, but for type cD the luminosity falls off more 


slowly in the outer regions. Comparison with Fig. 19.15 
shows that the brightness distribution in SO galaxies be¬ 
haves in a similar fashion. cD galaxies have often been 
erroneously classified as SO. (Thuan, T.X., Romanishin, 
W. (1981): Astrophys. J. 248, 439) 


Fig. 19.15 The 

distribution of surface 
brightness in types SO 
and Sb. Ordinate: 
mag/sq.arc sec; abscissa: 
radius [arc sec]. The 
observed surface brightness 
has been decomposed into 
a sum of bulge and disc 
contributions. Note the 
larger disc component in 
type Sb. (Boroson, T. 
(1981): Astrophys. J. 

Suppl. 46, 177) 



Although de Vaucouleurs’ law is a purely 
empirical relation, it still gives a remarkably 
good representation of the observed light distri¬ 
bution. However, in the outer regions of ellipti¬ 
cal galaxies, departures may often occur: the sur¬ 
face brightness of dwarf spheroidals often falls 
off more rapidly than (19.8), perhaps because 
the outer parts of these galaxies have been torn 
off in tidal encounters with other galaxies. In 
the giant galaxies of type cD, the surface bright¬ 
ness falls off more slowly (see Fig. 19.14). Such 
galaxies are usually at the centre of dense galaxy 


clusters, and during billions of years they have 
absorbed several smaller galaxies. The stars of 
the disrupted galaxies usually remain orbiting 
the giant galaxy in its outskirts increasing the 
brightness of the outer parts. cD galaxies have 
been called the cannibals of the world of galax¬ 
ies. 

Although the isophotes in elliptical galaxies 
are ellipses to a good approximation, their el- 
lipticities and the orientation of their major axes 
may vary as a function of radius. Different galax¬ 
ies differ widely in this respect, indicating that 
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the structure of ellipticals is not as simple as it 
might appear. In particular, the fact that the direc¬ 
tion of the major axis sometimes changes within 
a galaxy suggests that some ellipticals may not 
be axially symmetric in shape. A probable reason 
is that many galaxies of different shapes rotating 
in different directions have assimilated to the cur¬ 
rent galaxies. 

From the distribution of surface brightness, 
the three-dimensional structure of a galaxy may 
be inferred as explained in Box 19.1. The rela¬ 
tion (19.8) gives a brightness profile which is very 
strongly peaked towards the centre. The real dis¬ 
tribution of axial ratios for ellipticals can be sta¬ 
tistically inferred from the observed one. On the 
(questionable) assumption that they are rotation- 
ally symmetric, one obtains a broad distribution 
with a maximum corresponding to types E3-E4. 
If the true shape is not axisymmetric, it cannot 
even statistically be uniquely determined from 
the observations. 

Disks A bright, massive stellar disk is char¬ 
acteristic for SO and spiral galaxies, which are 
therefore called disk galaxies. There are indica¬ 
tions that in some ellipticals there is also a faint 
disk hidden behind the bright bulge. In the Milky 
Way the disk is formed by population I stars. 

The distribution of surface brightness in the 
disk is described by the expression 

I(r) = I 0 e~ r/ro . (19.9) 

Figure 19.11 shows how the observed radial 
brightness distribution can be decomposed into 
a sum of two components: a centrally domi¬ 
nant bulge and a disk contributing significantly at 
larger radii. The central surface brightness Iq typ¬ 
ically corresponds to 21-22 mag./sq.arcsec, and 
the radial scale length ro = 1-5 kpc. In Sc galax¬ 
ies the total brightness of the bulge is generally 
slightly smaller than that of the disk, whereas in 
earlier Hubble types the bulge has a larger total 
brightness. The thickness of the disk, measured 
in galaxies that are seen edge-on, may typically 
be about 1.2 kpc. Sometimes the disk has a sharp 
outer edge at about 4ro- 


The Interstellar Medium Elliptical and SO ga¬ 
laxies contain very little interstellar gas. How¬ 
ever, in some ellipticals neutral hydrogen amount¬ 
ing to about 0.1 % of the total mass has been de¬ 
tected, and in the same galaxies there are also 
often signs of recent star formation. In some 
SO galaxies much larger gas masses have been 
observed, but the relative amount of gas is very 
variable from one galaxy to another. The lack of 
gas in these galaxies is rather unexpected, since 
during their evolution the stars release much more 
gas than is observed. 

The relative amount of neutral hydrogen in 
spiral galaxies is correlated with their Hubble 
type. Thus Sa spirals contain about 2 %, Sc spi¬ 
rals 10 %, and Irr I galaxies up to 30 % or more. 

The distribution of neutral atomic hydrogen 
has been mapped in detail in nearby galaxies by 
means of radio observations. In the inner parts of 
galaxies the gas forms a thin disk with a fairly 
constant thickness of about 200 pc, sometimes 
with a central hole of a few kpc diameter. The gas 
disk may continue far outside the optical disk, be¬ 
coming thicker and often warped from the central 
disk plane. 

Most of the interstellar gas in spiral galaxies is 
in the form of molecular hydrogen. The hydrogen 
molecule cannot be observed directly, but the dis¬ 
tribution of carbon monoxide has been mapped 
by radio observations. The distribution of molec¬ 
ular hydrogen can then be derived by assuming 
that the ratio between the densities of CO and H 2 
is everywhere the same, although this may not 
always be true. It is found that the distribution 
obeys a similar exponential law as the young stars 
and HII regions, although in some galaxies (such 
as the Milky Way) there is a central density min¬ 
imum. The surface density of molecular gas may 
be five times larger than that of H I, but because 
of its strong central concentration its total mass is 
only perhaps two times larger. 

The distribution of cosmic rays and magnetic 
fields in galaxies can be mapped by means of 
radio observations of the synchrotron radiation 
from relativistic electrons. The strength of the 
magnetic field deduced in this way is typically 
0.5-1 11 T. The observed emission is polarised, 
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Fig. 19.16 A spiral galaxy NGC 5907 seen sideways. The galaxy is surrounded by star streams that are remnants of 
small dwarf galaxies merged to the big galaxy. (Photo Gabany, Martinez-Delgado et al., 2010, AJ 140, 962) 


showing that the magnetic field is fairly well- 
ordered on large scales. Since the plane of po¬ 
larisation is perpendicular to the magnetic field, 
the large-scale structure of the magnetic field can 
be mapped. However, the plane of polarisation is 
changed by Faraday rotation, and for this reason 
observations at several wavelengths are needed in 
order to determine the direction of the field. The 
results show that the field is generally strongest 
in the plane of the disk, and is directed along 
the spiral arms in the plane. The field is thought 
to have been produced by the combined action 
of rising elements of gas, perhaps produced by 
supernova explosions, and the differential rota¬ 
tion, in principle in the same way as the pro¬ 
duction of solar magnetic fields was explained in 
Chap. 13. 

The Outer Parts of Galaxies Even the outer¬ 
most parts outside the visible disk are not empty. 


Recently they have been seen to contain several 
different kinds of matter. The most abundant is 
the dark matter extending around the galaxy as a 
sphere or slightly flattened ellipsoid to a distance 
of tens of times further than the visible matter. 

Hot gas is the second largest component. Its 
high temperature and great abundance were sur¬ 
prising. For example, the Milky Way is sur¬ 
rounded by a region of hot gas extending as a 
spherical cloud as far as to 150 kiloparsecs. The 
temperature of the gas is 1-2.5 x 10 6 K and the 
total mass about the same order of magnitude as 
the total mass of the stars in the Milky Way. 

The third component comprises of separate 
gas clouds having greater velocities than other 
matter of the galaxy. Some of the clouds come 
from the centre of the Milky Way, some from 
other galaxies. 

Yet another component consists of star clusters 
and star streams. In addition to globular clusters 
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galaxies are accompanied by star concentrations 
that are remnants of dwarf galaxies assimilated 
to the galaxy. For example, the globular cluster 
M54 is not an original Milky Way cluster but a 
remnant of the Sagittarius dwarf ellipsoid moving 
through the central part of the Milky Way. Dis¬ 
rupting dwarf galaxies are sometimes seen as star 
streams around the galaxy (Fig. 19.16). 


Box 19.1 (Three-Dimensional Shape of Galax¬ 
ies) Equations (19.8) and (19.9) describe the 
distribution of galactic light projected on the 
plane of the sky. The actual three-dimensional 
luminosity distribution in a galaxy is obtained 
by inverting the projection. This is easiest for 
spherical galaxies. 



Let us suppose that a spherical galaxy has 
the projected luminosity distribution I(r) (e.g. 
as in (19.8)). With coordinates chosen ac¬ 
cording to the figure, I (r) is given in terms 
of the three-dimensional luminosity distribu¬ 
tion p(R) by 


I(r) = 



p(R)dz. 


Since z 2 = R 2 — r 2 , a change of the variable of 
integration yields 


Ur) = 2 



p(R)RdR 
V R 2 — r 2 


This is known as an Abel integral equation 
for p(R), and has the solution 


1 d r°° I (r )r dr 

nR d R J R Vr 2 - R 2 


1 r°° (dl/dr) dr 

n Jr sir 1 - R 2 ’ 


Introducing the observed I (r) into this expres¬ 
sion, one obtains the actual luminosity dis¬ 
tribution p(R). In the figure the solid curve 
shows the three-dimensional luminosity distri¬ 
bution obtained from the Vaucouleurs’ law (the 
dashed line). 



If the galaxy is not spherical, its three- 
dimensional shape can only be determined if 
its inclination with respect to the line of sight 
is known. Since galactic disks are thin and of 
constant thickness, the inclination i of a disk 
galaxy is obtained directly from the axis ratio 
of its projected image: sin i = b/a. 

When the inclination is known, the real axis 
ratio of the bulge qo can be determined from the 
projected value q. For a rotationally symmetric 
bulge the relation between q and qo is 


v. cj i — _ . 

l-?o 2 

The flattenings of disk galaxy bulges obtained 
from this relation lie in the range qo — 0.3-0.5. 
Since the inclinations of ellipticals are gener¬ 
ally unknown, only the statistical distribution 
of q can be determined from that of qo- 


19.4 Dynamics of Galaxies 

We have seen how the masses of galaxies can be 
derived from observed velocities of stars and gas. 
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The same observations can be used to study the 
internal distribution of mass in more detail. 

Slowly Rotating Systems The dynamics of el¬ 
liptical galaxies and disk galaxy bulges are stud¬ 
ied by means of the Doppler shifts and broad- 
enings of stellar absorption lines. Since a given 
absorption line is the sum of contributions from 
many individual stars, its Doppler shift gives their 
mean velocity, while its broadening is increased 
by an amount depending on the dispersion of stel¬ 
lar velocities around the mean. By observing how 
the wavelengths and widths of spectral lines be¬ 
have as functions of the radius, one can get some 
insight into the distribution of mass in the galaxy. 

Examples of the observed radial dependence 
of the rotational velocity and velocity disper¬ 
sion derived for some ellipticals were given in 
Fig. 19.10. The observed rotational velocities are 
often small (<100 kms -1 ), while the velocity 
dispersion may typically be about 200 kms -1 . If 
elliptical galaxies were in fact ellipsoids of revo¬ 
lution, there should be a statistical relation (when 
projection effects have been taken into account) 
between flatness, rotational velocity and velocity 
dispersion. Such a relationship has been observed 
for fainter ellipticals and for disk galaxy bulges. 
However, some of the brightest ellipticals rotate 
very slowly. Therefore their flattening cannot be 
due to rotation. 

The radial dependence of the velocity disper¬ 
sion gives information on the distribution of mass 
within the galaxy. Since it also depends on how 
the shapes of stellar orbits in the galaxy are dis¬ 
tributed, its interpretation requires detailed dy¬ 
namical models. 

Some ellipticals have different velocity sys¬ 
tems. An extreme case is NGC 4550 classified as 
E7/S0. Half of its stars orbit the centre in one di¬ 
rection and the rest in the opposite direction. This 
indicates that the galaxy is a merger of two spiral 
galaxies that originally rotated in opposite direc¬ 
tions. Mutual distances of stars in ellipticals are 
so large that collisions do not occur and the cur¬ 
rent system is very stable. 

Rotation Curves In spiral galaxies the distri¬ 
bution of mass can be studied directly using the 


observed rotational velocities of the interstellar 
gas. This can be observed either at optical wave¬ 
lengths from the emission lines of ionised gas 
in HII regions or at radio wavelengths from the 
hydrogen 21 cm line. Typical galactic rotation 
curves were shown in Fig. 19.11. 

The qualitative behaviour of the rotation curve 
in all spiral galaxies is similar to the rotation 
curve of the Milky Way: there is a central por¬ 
tion, where the rotational velocity is directly pro¬ 
portional to the radius, corresponding to rigid 
body rotation. At a few kpc radius the curve turns 
over and becomes flat, i.e. the rotational veloc¬ 
ity does not depend on the radius. In early Hub¬ 
ble types, the rotation curve rises more steeply 
near the centre and reaches larger velocities in 
the flat region (Sa about 300 kms -1 , Sc about 
200 kms -1 ). A higher rotational velocity indi¬ 
cates a larger mass according to (19.7), and thus 
Sa types must have a larger mass density near 
the centre. This is not unexpected, since a more 
massive bulge is one of the defining properties of 
early type spirals. 

A decrease of the rotational velocity at large 
radii would be an indication that most of the mass 
is inside that radius. In some galaxies such a de¬ 
crease has been detected, in others the rotational 
velocity remains constant as far out as the obser¬ 
vations can reach. 

Spiral Structure Spiral galaxies (Fig. 19.17) 
are relatively bright objects. Some have a well- 
defined, large-scale two-armed spiral pattern, 
whereas in others the spiral structure is made up 
of a large number of short filamentary arms. From 
galaxies where the pattern is seen in front of the 
central bulge, it has been deduced that the sense 
of winding of the spiral is trailing with respect 
to the rotation of the galaxy. However, the rule is 
not absolute: there are also galaxies with leading 
arms. 

The spiral structure is most clearly seen in the 
interstellar dust, HII regions, and the OB asso¬ 
ciations formed by young stars. The dust often 
forms thin lanes along the inner edge of the spiral 
arms, with star forming regions on their outside. 
Enhanced synchrotron radio emission associated 
with spiral arms has also been detected. 
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Fig. 19.17 Above: 

A spiral galaxy from 
above: M51 (type Sc). The 
interacting companion is 
NGC5195 (type Irr II). 
(Lick Observatory). Below: 
A spiral galaxy from the 
side: the Sb spiral 
NGC4565. (NOAO/Kitt 
Peak National 
Observatory) 



A well formed wide spiral pattern is relatively 
common, and hence it must be a long lasting phe¬ 
nomenon. The spiral pattern is generally thought 
to be a wave in the density of the stellar disk, 
as discussed in Sect. 18.4. As the interstellar 
gas streams through the density wave a shock, 
marked by the dust lanes, is formed as the in¬ 
terstellar gas is compressed, leading to the col¬ 


lapse of molecular clouds and the formation of 
stars. The density wave theory predicts charac¬ 
teristic streaming motions within the arm, which 
have been detected in some galaxies by observa¬ 
tions of the HI 21 cm line. 

There may be several reasons producing spiral 
arms. The most common is an external perturba¬ 
tion when another galaxy passes by at a close dis- 





404 


19 Galaxies 


tance creating a wave in the galaxy. Other sources 
of perturbation could be the bar of the galaxy or 
the gravitational field of a temporary "traffic jam” 
of stars. 

A single close encounter or a traffic jam can 
explain temporary spiral arms but not more per¬ 
manent features. The differential rotation of the 
galaxy tends to wipe out the arms during a few 
revolutions or in a few hundreds of millions of 
years. Some renewal mechanism is needed to 
keep the arms permanent. 

Standing waves can explain the permanence. 
Birth of spiral arms and their motion gener¬ 
ate density waves travelling through the whole 
galaxy. The waves can be reflected or refracted 
by the central bulge. When wave fronts moving 
in different directions meet they can either damp 
or amplify each others. Spiral arms are standing 
waves, fronts of amplified waves, seen as concen¬ 
trates of stars and gas clouds. The spiral forms 
move around the galaxy at constant speed. 


19.5 Stellar Ages and Element 
Abundances in Galaxies 

From the Milky Way we know that stars of pop¬ 
ulations I and II are different not only in respect 
to their spatial distribution, but also in respect to 
their ages and heavy element abundances. This 
fact gives important evidence about the formation 
of the Milky Way, and it is therefore of interest if 
a similar connection can be found in other galax¬ 
ies. 

The indicators of composition most easily 
measured are the variations of colour indices in¬ 
side galaxies and between different galaxies. Two 
regularities have been discovered in these varia¬ 
tions: First, according to the colour-luminosity 
relation for elliptical and SO galaxies, brighter 
galaxies are redder. Secondly, there is a colour- 
aperture effect, so that the central parts of galax¬ 
ies are redder. For spirals this relationship is due 
to the presence of young, massive stars in the 
disk, but it has also been observed for elliptical 
and SO galaxies. 

Galactic spectra are composed of the spectra 
of all their stars added together. Thus the colours 
depend both on the ages of the stars (young stars 


are bluer) and on the heavy element abundance Z 
(stars with larger Z are redder). The interpreta¬ 
tion of the observational results thus has to be 
based on detailed modelling of the stellar com¬ 
position of galaxies or population synthesis. 

Stars of different spectral classes contribute 
different characteristic absorption features to the 
galaxy spectrum. By observing the strength of 
various spectral features, one can find out about 
the masses, ages and chemical composition of 
the stars that make up the galaxy. For this pur¬ 
pose, a large number of characteristic properties 
of the spectrum, strengths of absorption lines and 
broad-band colours are measured. One then at¬ 
tempts to reproduce these data, using a represen¬ 
tative collection of stellar spectra. If no satisfac¬ 
tory solution can be found, more stars have to 
be added to the model. The final result is a pop¬ 
ulation model, giving the stellar composition of 
the galaxy. Combining this with theoretical stellar 
evolution calculations, the evolution of the light 
of the galaxy can also be computed. 

Population synthesis of E galaxies show that 
practically all their stars were formed simulta¬ 
neously about 13-14 x 10 9 years ago, which is 
close to the age of the universe Most of their light 
comes from red giants, whereas most of their 
mass resides in lower main sequence stars of less 
than one solar mass. 

Since all stars have roughly the same age, the 
colours of elliptical galaxies are directly related 
to their metallicities. Thus the colour-luminosity 
relation indicates that Z in giant ellipticals may 
be double that in the solar neighbourhood, while 
it may be smaller by a factor 100 in dwarfs. Simi¬ 
larly, the radial dependence of the colours can be 
explained if the value of Z at the centre is an or¬ 
der of magnitude larger than it is at larger radii. 

The stellar composition of disk galaxy bulges 
is generally similar to that of ellipticals. The ele¬ 
ment abundances in the gas in spirals can be stud¬ 
ied by means of the emission lines from HII re¬ 
gions ionised by newly formed stars. In this case 
too, the metallicity increases towards the centre. 

The birth histories of galaxies explain partly 
the different metallicities and age distributions. 
As we will see later, galaxies collide and merge 
in the span of billions of years. Most clearly this 




19.6 Systems of Galaxies 


405 


is seen in dwarf galaxies having sow few stars 
that even a single merger leaves a distinct signa¬ 
ture. Figure 19.18 shows the age distribution of 
two Milky Way’s neighbour galaxies. A few of 
the stars of Leo A were born immediately after 
the origin of the universe but most of the stars 
were born during the last 5-6 billion years. The 
stars of the Carina dwarf galaxy were born dur¬ 
ing three separate periods, at the age of 1-2, 6-8 
and about 11 billion years. 


19.6 Systems of Galaxies 

The galaxies are not smoothly distributed in 
space; rather, they form systems of all sizes: 
galaxy pairs, small groups, large clusters and su¬ 
perclusters formed from several groups and clus¬ 
ters. The larger a given system, the less its density 
exceeds the mean density of the Universe. On the 
average, the density is twice the background den¬ 
sity for systems of radius 5 Mpc and 10 % above 
the background at radius 20 Mpc. 

Interactions of Galaxies Galaxies do not 
evolve in isolation; instead, they interact continu¬ 
ously with other galaxies. Each large galaxy has 
experienced encounters or collisions with several 
other large galaxies and tens or even hundreds 
encounters with much smaller galaxies. 

Among the most common distortions are 
“bridges” and “tails” caused by tidal forces dur¬ 
ing an encounter (Fig. 19.19). Sometimes a small 
galaxy breaks down and forms a ring in the equa¬ 
torial plane of the larger galaxy or sometimes on 
a polar orbit. The most striking ring galaxies are 
born when a dense galaxy passes straight through 
another one (Fig. 19.20). In many cases the ef¬ 
fects of collisions are seen as starbursts when the 
gas and dust clouds of the galaxies meet, con¬ 
dense and start to produce new stars. Sometimes 
image processing is needed to show faint but 
sharp rings around a galaxy (Fig. 19.21). 

The interactions between galaxies are not al¬ 
ways dramatic. For example, the Milky Way 
has two satellites, the Large and Small Mag¬ 
ellanic Clouds (see Fig. 19.5), which are Irr I 
type dwarf galaxies at about 60 kpc distance. 
It is thought that approximately 5 x 10 8 years 



Fig. 19.18 The age distribution of the dwarf galaxies in 
Leo and Carina shows that the stars of the galaxies were 
formed in separate collision episodes. (Tolstoy, Hill, Tosi, 
2009, Ann. Rev. Astron. Astrophys. 47, 371) 

ago, these passed the Milky Way at a distance of 
about 10-15 kpc, leaving behind the Magellanic 
Stream, a 180° long thin stream of neutral hy¬ 
drogen clouds. Systems of this type, where a gi¬ 
ant galaxy is surrounded by a few small compan¬ 
ions, are quite common. Computations show that 
in many such cases the tidal interactions are so 
strong that the companions will merge with the 
parent galaxy at the next close approach. This is 
likely to happen to the Magellanic Clouds. 

During earlier epochs in the Universe, when 
the density was larger, interactions between galax¬ 
ies must have been much more common than at 
present. Thus it has been proposed that a large 
fraction of bright galaxies have undergone ma¬ 
jor mergers at some stage in their history. In 
particular, there are good reasons to believe that 
the slowly rotating, non-axisymmetric giant el¬ 
lipticals may have formed by the merger of disk 
galaxies. 
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Fig. 19.19 The galaxy 
pair NGC 4676 is known as 
the Mice. The galaxies are 
about to collide and a long 
gas and star tail protrudes 
from each of them. 
(Hubble/NASA, H. Ford 
(JHU), G. Illingworth 
(UCSC/LO), M. Clampin 
(STScI), G. Hartig 
(STScI), the ACS Science 
Team, ja ESA) 



Fig. 19.20 The Cartwheel 
is the best known ring 
galaxy. It was born when a 
small galaxy passed 
through the centre of the 
big galaxy. (Photo Hubble, 
K. Borne (StScI) and Nasa) 



Groups The most common type of galaxy sys¬ 
tems are small, irregular groups of a few tens of 
galaxies. A typical example is the Local Group, 
which contains two larger galaxies in addition to 
the Milky Way—the Andromeda Galaxy M31, 
an Sb spiral of about the same size as the Milky 
Way with two dwarf companions, and the smaller 
Sc spiral M33. The rest of the about 50 members 


of the Local Group are dwarfs; about 20 are of 
type dE and 10 of type Irr I. The diameter of the 
Local Group is about 1.2 Mpc. 

Clusters A system of galaxies may be defined 
to be a cluster if it contains a larger number (at 
least 50) of bright galaxies. The number of mem¬ 
bers and the size of a cluster depend on how they 
are defined. One way of doing this is to fit the 
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Fig. 19.21 Rings of stars 
with sharp boundaries in 
the outer parts of the 
galaxy NGC 474. The rings 
were born when a 
lenticular galaxy tore a 
rotating spiral galaxy layer 
by layer. 

(CFHT/Coelum—J.-C. 
Cuillandre & G. Anselmi) 
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observed distribution of galaxies within a cluster 
with an expression of the form (19.8). In this way 
a characteristic cluster radius of about 2-5 Mpc is 
obtained. The number of members depends both 
on the cluster radius and on the limiting magni¬ 
tude. A large cluster may contain several hundred 
galaxies that are less than two magnitudes fainter 
than the characteristic luminosity L* of (19.2). 

Clusters of galaxies can be ordered in a se¬ 
quence from extended, low-density, irregular sys¬ 
tems (sometimes called clouds of galaxies) to 
denser and more regular structures (Fig. 19.23). 
The galaxy type composition also varies along 
this sequence in the sense that in the loose irregu¬ 
lar clusters, the bright galaxies are predominantly 
spirals, whereas the members of dense clusters 
are almost exclusively type E and SO. The nearest 
cluster of galaxies is the Virgo Cluster at a dis¬ 
tance of about 15 Mpc. It is a relatively irregu¬ 
lar cluster, where a denser central region contain¬ 
ing early galaxy types is surrounded by a more 
extended distribution of mainly spiral galaxies. 
The nearest regular cluster is the Coma Cluster, 



Fig. 19.22 A theoretical X-ray spectrum for plasma with 
a temperature of 10 8 K. Most of the radiation comes 
from the bremsstrahlung of electrons, peaking at 8.6 kilo- 
electronvolts (thick line). Superimposed are spectral lines 
produced by different ions. (Bohringer & Hensler, 1989, 
A&A215, 147) 

roughly 90 Mpc away. In the Coma Cluster a cen¬ 
tral pair of giant ellipticals is surrounded by a flat- 
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Fig. 19.23 Above: The 
irregular Virgo Cluster of 
galaxies. Below: The 
regular Coma Cluster (ESO 
and Karl-Schwarzschild- 
Observatorium) 



tened (axis ratio about 2:1) system of early type 
galaxies. 

Earlier galaxy groups were studied as con¬ 
glomerates of visible galaxies. Recently two 
other, more massive, components have been in¬ 
cluded. 

First of all, galaxy clusters are surrounded 
by large clouds of hot gas, as mentioned in 
Sect. 19.2. The gas can be observed by its X- 


ray emission. The spectrum shows features typ¬ 
ical for bremsstrahlung of electrons and lines of 
totally ionised metals (like FeXXIV). Tempera¬ 
ture measured from X-ray spectra are 10 7 —10 s K. 
The total mass of the gas in the poorest clusters 
is about 5 times and in the richest clusters even 
20 times as big as the mass inside the galaxies. 
Part of the gas comes from the galaxies (jets of 
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quasars, supernova explosions), but most of it is 
hydrogen and helium from the big bang. 

Besides in X-ray images the existence of the 
hot gas is seen also as extra spots in the cosmic 
background radiation, caused by the Sunyayev- 
Zel’dovich effect. Photons of the background ra¬ 
diation hit electrons of the hot gas and receive 
some extra energy (inverse Compton scattering). 
Below 218 GHz the radiation is slightly cooler 
and above 218 GHz a little brighter. The diameter 
of the spot is about one arc minute and the tem¬ 
perature difference from the surroundings 0.1-1 
millikelvins (Fig. 19.25). One of the main objec¬ 
tives of the Planck satellite was to find gas clouds 
around galaxy clusters using this phenomenon. 
The final catalogue of Planck (2015) contains 
over 1600 objects most of which are previously 
unknown galaxies with redshifts z = 0 — 1.5. 

Without a strong source of gravitation the hot 
gas would quickly disperse into space. The grav¬ 
itation is caused by dark matter; its mass is about 
five times as big as that of the ordinary matter 
(hot gas + visible galaxies) in the cluster. Grav¬ 
itational lenses have been used to investigate the 
dark matter in galaxy clusters (Sect. 19.7). 

Superclusters Groups and clusters of galaxies 
may form even larger systems, superclusters. For 
example, the Local Group belongs to the Lo¬ 
cal Supercluster , a flattened system whose centre 
is the Virgo Cluster, containing tens of smaller 
groups and clouds of galaxies. The Coma Clus¬ 
ter is part of another supercluster. The diame¬ 
ters of superclusters are 10-20 Mpc. However, 
on this scale, it is no longer clear whether one 
can reasonably speak of individual systems. Per¬ 
haps it would be more accurate to think of the 
distribution of galaxies as a continuous network, 
where the large clusters are connected by walls 
and strings formed by smaller systems. Between 
these there remain empty regions containing very 
few galaxies, which can be up to 50 Mpc in di¬ 
ameter (Figs. 19.24, 20.8). 


19.7 Active Galaxies and Quasars 

So far in this chapter we have been concerned 
with the properties of normal galaxies. In some 



Fig. 19.24 The large-scale space distribution of 245,591 
galaxies. The radial co-ordinate is the redshift, which can 
be translated into a distance using a value for the Hub¬ 
ble constant. The thickness of the slices is about 10°. 
(2dFGRS Team, http://www2.aao.gov.au/~TDFgg/) 

galaxies, however, the normal galaxy is overshad¬ 
owed by violent activity. This activity is produced 
in the nucleus, which is then called an active 
galactic nucleus (AGN). 

The luminosities of active galactic nuclei may 
be extremely large, sometimes much larger than 
that of the rest of the galaxy. It seems unlikely 
that a galaxy could maintain such a large power 
output for long. For this reason it is thought that 
active galaxies do not form a separate class of 
galaxies, but rather represent a passing stage in 
the evolution of normal galaxies. 

Activity appears in many different forms. 
Some galaxies have an exceptionally bright nu¬ 
cleus similar to a large region of ionised hydro¬ 
gen. These may be young galaxies, where near 
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S/N 


- 7.9 - 2.9 2.1 7.1 12.1 17.1 22.1 



316.7 316.6 316.5 316.4 


Fig. 19.25 A brightening of the cosmic background ra¬ 
diation caused by the Sunyayev-Zel’dovits effect. The ra¬ 
dio observation in the millimetre band were made with 
the South pole telescope. The amplitude of the brightening 
tells that the temperature of the gas cloud is 1.2 x 10 8 K 
and the total mass of the galaxy cluster about 10 15 solar 
masses. Optical images of the same location show a dis¬ 
tant galaxy cluster SPT-CL J2106-5844, which has a red- 
shift z = 1.13. (Foley, Andersson, Bazin, de Haan, Ruel et 
al„ 2011, ApJ 731, p. 89) 

the centre large numbers of stars are forming and 
evolving into supernovae (starburst nuclei). In 
other nuclei the radiation cannot have been pro¬ 
duced by stars, and the most plausible source of 
energy in these nuclei is the gravitational energy 
of a supermassive black hole (mass > 10 8 M q ). 
In some galaxies, the spectral lines are unusually 
broad, indicating large internal velocities. These 
may be either rotational velocities near a black 
hole or due to explosive events in the nucleus. 
In some galaxies, jets are seen coming out of the 
nucleus. Many active galaxies radiate a nonther- 
mal spectrum, apparently synchrotron radiation 
produced by fast electrons in a magnetic field. 

The classification of active galaxies has been 
developed rather unsystematically, since many 
of them have been discovered only recently, and 
have not been completely studied. For example, 
the Markarian galaxies catalogued by Benyamin 
Yerishevich Markarian in the early 1970’s are 


defined by strong ultraviolet emission. Many 
Markarian galaxies are Seyfert galaxies; others 
are galaxies undergoing a burst of star formation. 
The N galaxies form another class closely similar 
to the Seyfert galaxies. 

Two natural basic classes of active galaxies are 
the Seyfert galaxies and the radio galaxies. The 
former are spirals; the latter are ellipticals. Some 
astronomers think that the Seyfert galaxies repre¬ 
sent the active stage of normal spiral galaxies and 
the radio galaxies that of ellipticals. 

Seyfert Galaxies The Seyfert galaxies are 
named after Carl Seyfert , who discovered them 
in 1943. Their most important characteristics are 
a bright, pointlike central nucleus and a spectrum 
showing broad emission lines. The continuous 
spectrum has a nonthermal component, which is 
most prominent in the ultraviolet. The emission 
lines are thought to be produced in gas clouds 
moving close to the nucleus with large velocities. 

On the basis of the spectrum, Seyfert galaxies 
are classified as type 1 or 2. In a type 1 spec¬ 
trum, the allowed lines are broad (correspond¬ 
ing to a velocity of 10 4 kms -1 ), much broader 
than the forbidden lines. In type 2, all lines are 
similar and narrower (<10 3 kms -1 ). Transitions 
between these types and intermediate cases have 
sometimes been observed. The reason for the dif¬ 
ference is thought to be that the allowed lines are 
formed in denser gas near the nucleus, and the 
forbidden lines in more diffuse gas further out. In 
type 2 Seyfert galaxies, the denser gas is missing 
or obscured. 

Almost all Seyfert galaxies with known Hub¬ 
ble types are spirals; the possible exceptions 
are of type 2. They are strong infrared sources. 
Type 1 galaxies often show strong X-ray emis¬ 
sion. 

The true Seyfert galaxies are relatively weak 
radio sources. However, there are compact ra¬ 
dio galaxies with an optical spectrum that is es¬ 
sentially the same as for Seyfert galaxies. These 
should probably be classified with the Seyfert 
galaxies. In general, the stronger radio emission 
seems to come with a type 2 spectrum. 

It is estimated that about 1 % of all bright spi¬ 
ral galaxies are Seyfert galaxies. The luminosi¬ 
ties of their nuclei are about 10 36 —10 39 W, of the 
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same order as all the rest of the galaxy. Brightness 
variations are common. 

Radio Galaxies By definition, radio galaxies 
are galaxies that are powerful radio sources. The 
radio emission of a radio galaxy is non-thermal 
synchrotron radiation. The radio luminosity of ra¬ 
dio galaxies is typically 10 33 -10 38 W, and may 
thus be as large as the total luminosity of a nor¬ 
mal galaxy. The main problem in explaining ra¬ 
dio emission is to understand how the electrons 
and magnetic fields are produced, and above all, 
where the electrons get their energy. 

The forms and sizes of the radio emitting re¬ 
gions of radio galaxies have been studied ever 
since the 1950’s, when radio interferometers 
achieved the resolution of optical telescopes. The 
characteristic feature of a strong radio galaxy is 
a double structure: there are two large radio emit¬ 
ting regions on opposite sides of the observed 
galaxy. The radio emitting regions of some ra¬ 
dio galaxies are as far apart as 6 Mpc, almost ten 
times the distance between the Milky Way and 
Andromeda galaxies. One of the smallest dou¬ 
ble radio sources is the galaxy M87 (Fig. 19.26), 
whose two components are only a few kpc distant 
from each other. 

The double structure of radio galaxies appears 
to be produced by ejections from the nucleus. 
However, the electrons in the radio lobes can¬ 
not be coming from the centre of the galaxy, be¬ 
cause they would lose all their energy during such 
a long transit. Therefore electrons have to be con¬ 
tinuously accelerated within the radio-emitting 
regions. Within the radio lobes there are almost 
point-like regions, hot spots. These are generally 
symmetrically placed with respect to the nucleus, 
and are apparently consequences of nuclear ejec¬ 
tions. 

“Tailed” radio sources also exist. Their ra¬ 
dio emission mainly comes from one side of the 
galaxy, forming a curved tail, which is often tens 
of times longer than the diameter of the galaxy. 
The best examples are NGC 1265 in the Perseus 
cluster of galaxies and 3C129, which appears 
to be in an elliptical orbit around a companion 
galaxy. The tail is interpreted as the trail left by 
the radio galaxy in intergalactic space. 


Another special feature revealed by the radio 
maps is the presence of jets, narrow lines of ra¬ 
dio emission, usually starting in the nucleus and 
stretching far outside the galaxy. The best known 
may be the M87 jet, which has also been observed 
as an optical and X-ray jet. The optically ob¬ 
served jet is surrounded by a radio source. A simi¬ 
lar radio source is seen on the opposite side of the 
nucleus, where no optical jet is seen. Our near¬ 
est radio galaxy Centaurus A also has a jet ex¬ 
tending from the nucleus to near the edge of the 
galaxy. 

VLBI observations of radio jets have also re¬ 
vealed superluminal motions : in many compact 
sources the components appear to be separating 
faster than the speed of light. Since such veloci¬ 
ties are impossible according to the theory of rel¬ 
ativity, the observed velocities can only be appar¬ 
ent, and several models have been proposed to ac¬ 
count for them. 

Quasars The first quasar was discovered in 
1963, when Maarten Schmidt interpreted the op¬ 
tical emission lines of the known radio source 
3C273 as hydrogen Balmer lines redshifted by 
16 %. Such large redshifts are the most re¬ 
markable characteristics of the quasars. Properly 
speaking, the word quasar is an abbreviation for 
quasistellar radio source, and some astronomers 
prefer to use the designation QSO (quasistellar 
object), since not all quasars emit radio radia¬ 
tion. 

Optically the quasars appear almost as point 
sources, although improved observational tech¬ 
niques have revealed an increasing number of 
quasars located inside more or less normal galax¬ 
ies (Fig. 19.27). Although the first quasars where 
discovered by radio observations, only a small 
fraction of all optically identified quasars are 
bright radio sources. Most radio quasars are point 
sources, but some have a double structure like the 
radio galaxies. Satellite X-ray pictures also show 
the quasars to be pointlike. 

In the visible region the quasar spectra are 
dominated by spectral lines with rest wavelengths 
in the ultraviolet. The first observed quasar red- 
shifts were z = 0.16 and 0.37, and later searches 
have continued to turn up ever larger redshifts. 
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Fig. 19.26 Above'. The active galaxy M87. In the lower 
right-hand comer a short exposure of the core region has 
been inserted (same scale as in the main photograph). 
One sees a blue jet coming out of the nucleus of a nor¬ 
mal EO galaxy (NOAO/Kitt Peak National Observatory). 

The present record is 8.7, but three objects have 
redshift 10. If these are quasars their light must 
have been emitted when the age of the universe 
was only 500 million years. The large inferred 
distances of the quasars mean that their luminosi¬ 
ties have to be extremely large. Typical values lie 
in the range of 10 38 —10 41 W. The brightness of 


Below. In the radio map made using the VLA the jet 
is observed to be two-sided. The area shown is much 
smaller than in the upper picture. (Owen, F.N., Hardee, 
P.E., Bignell, R.C. (1980): Astrophys. J. (Lett.) 239 , LI 1) 


quasars may vary rapidly, within a few days or 
less. Thus the emitting region can be no larger 
than a few light-days, i.e. about 100 au. 

The quasars often have both emission and ab¬ 
sorption lines in their spectra. The emission lines 
are very broad and are probably produced in 
the quasar itself. Much of the absorption spec- 
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Fig. 19.27 One of the nearest quasars, 3C 273, pho¬ 
tographed with two cameras aboard the Hubble Space 
Telescope. On the left, the Wide Field Planetary Camera 
sees a bright point-like source, with a jet blasted out from 
the quasar (towards 5 o’clock). On the right, a corona- 


graph in the Advanced Camera for Surveys blocks out the 
brightest parts of the quasar. Spiral arms in the host galaxy 
can be seen, with dark dust lanes, as well as new details in 
the path of the jet. (Photos Hubble/NASA/ESA) 



Fig. 19.28 A section of the spectrum of the quasar 
Q1422+2309 observed with the Keck 10 metre telescope. 
The Lyman a line of the quasar is near the right edge. Its 
redshift is 3.62. To the left of the line is a Lyman a for- 

trum consists of densely distributed narrow lines 
that are thought to be hydrogen Lyman a lines 
formed in gas clouds along the line of sight to 
the quasar. The clouds producing this “Lyman 
a forest" (Fig. 19.28) are young galaxies or pro¬ 
togalaxies, and they therefore provide important 
evidence about the formation of galaxies. Lyman 
a forests can be used to map intergalactic gas 
clouds up to redshift z = 6 (Fig. 19.29). Fur¬ 


est, where each absorption line corresponds to a hydrogen 
cloud that the radiation had to penetrate. Wavelengths are 
given in angstroms. (Kuva Womble, Sargent, Lyons) 


ther out Lyman a absorption lines cannot be seen 
since most of the hydrogen is neutral. 

Unified Models Although the forms of galac¬ 
tic activity may at first sight appear diverse, they 
can be unified within a fairly widely accepted 
schematic model in such a way that the properties 
of each individual object depend on a few param¬ 
eters only. 
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According to this model, most galaxies con¬ 
tain a compact central nucleus, which is a super- 
massive black hole, with mass 10 7 -10 10 Mq, sur¬ 
rounded by a disk or ring of gas. The source of 
energy is the gravitational energy released as gas 
is accreted into the black hole. The disk may also 
give rise to a jet, where some of the energy is con¬ 
verted into perpendicular motions along the rota¬ 
tional axis. Thus active galactic nuclei are simi¬ 
lar to the nucleus of the Milky Way, although the 
masses of both the black hole and the gas disk 
may be much larger. 

Deducing the mass of the central black hole 
is difficult and uncertain. However, using a vari¬ 
ety of methods involving the motions of stars and 
gas at the centre of nearby galaxies black hole 
masses for about 30 galaxies have been deter¬ 
mined. The most important result of these studies 
is that there is a close relationship between the 
black hole mass and the central velocity disper¬ 
sion of the galaxy. According to the virial theo¬ 
rem the velocity dispersion is a measure of the 
bulge mass, and therefore the conclusion is that 
there is a close relationship between the mass of 
the bulge and the mass of the central black hole. 

The first characteristic parameter of the uni¬ 
fied model is obviously the total luminosity. For 



Fig. 19.29 Simulation of hydrogen distribution in space. 
The light emitted by a quasar goes through different gas 
clouds creating a Lyman a forest. (Renyue Cen, Princeton 
University) 


example, the only essential difference between 
Seyfert 1 galaxies and radio-quiet quasars is the 
larger luminosity of quasars. Another basic pa¬ 
rameter is the radio brightness, which may be re¬ 
lated to the strength of a jet. On the basis of their 
radio luminosity one can connect Seyfert galaxies 
and radio-quiet quasars on one hand, and radio 
galaxies and radio quasars on the other. 

The third important parameter of unified mod¬ 
els is the angle from which we happen to view 
the nuclear disk. For example, if the disk is seen 
edge-on, the actual nucleus is obscured by the 
disk. This could explain the difference between 
Seyfert types 1 and 2: in type 2 we do not see 
the broad emission lines formed near the black 
hole, but only the narrower lines from the disk. 
Similarly a galaxy that looks like a double radio 
source when seen edge-on, would look like a ra¬ 
dio quasar if the disk were seen face-on. In the 
latter case there is a possibility that we may be 
seeing an object directly along the jet. It will then 
appear as a blazar, an object with rapid and vio¬ 
lent variations in brightness and polarisation, and 
very weak or invisible emission lines. If the jet 
is almost relativistic, its transverse velocity may 
appear larger than the speed of light, and thus su¬ 
perluminal motions can also be understood. 

One prediction of the unified model is that 
there should be a large number of quasars where 
the nucleus is obscured by the disk as in the 
Seyfert 2 galaxies. In analogy with the Seyfert 
galaxies these objects are referred to as type 2 
AGN or quasars. Because of the obscuration 
such sources would not be included in surveys 
at optical, UV, or soft X-ray wavelengths. In 
hard X-rays the obscuration is less, and in the 
far infrared the absorbed energy is re-radiated. 
Searches for type 2 quasars have been made at 
these wavelengths with the Chandra X-ray satel¬ 
lite and with the Spitzer Space Telescope. The 
indications from these searches are that at least 
3/4 of all supermassive black holes are heavily 
obscured. 

Gravitational Lenses An interesting pheno¬ 
menon first discovered in connection with quasars 
are gravitational lenses. Since light rays are bent 
by gravitational fields, a mass (e.g. a galaxy) 
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Fig. 19.30 (a) The 

components of the Einstein 
cross are gravitationally 
lensed images of the same 
quasar (ESA/NASA). 

(b) The massive galaxy 
cluster Abell 2218 deflects 
light rays passing through 
it and acts as a giant 
gravitational lens. In this 
Hubble picture from 
January 2000, dozens of 
arc-shaped images of 
distant galaxies can be 
seen. (Photo A. Fruchter, 

S. Baggett, R. Hook and 
Z. Levay, NASA/STScI) 



placed between a distant quasar and the observer 
will distort the image of the quasar. The first ex¬ 
ample of this effect was discovered in 1979, when 
it was found that two quasars, 5.7" apart in the 
sky, had essentially identical spectra. It was con¬ 
cluded that the “pair” was really a double image 
of a single quasar. Since then several other grav¬ 
itationally lensed quasars have been discovered 
(Fig. 19.30). 


Gravitational lenses have also been discov¬ 
ered in clusters of galaxies. Here the gravitational 
field of the cluster distorts the images of dis¬ 
tant galaxies into arcs around the cluster centre. 
Currently some two hundred individual galaxies 
and twenty galaxy clusters are known to pro¬ 
duce lensed images of background galaxies or 
quasars. Such systems are called strong gravita¬ 
tional lenses (Fig. 19.31). 
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Fig. 19.31 A strong gravitational lens. The light of a quasar is bent by the gravity of a lensing galaxy. The quasar 
appears as two images on both sides of the lensing galaxy 




Flattened 

image 

Actual 

shape 


Fig. 19.32 A weak gravitational lens. Light rays coming from the upper and lower edges of a distant galaxy are bent 
by different amounts. The image of the galaxy becomes slightly flattened 


Recently emphasis has moved towards weak 
gravitational lenses , which are much more com¬ 
mon (Fig. 19.32). The lens does not produce sep¬ 
arate images or arcs from a background but only 
flattens its shape. Since the axial ratio changes 
only by 1-2 %, the weak lenses cannot be used 
to study shapes of individual galaxies because 
their original shapes are not known. However, if 
hundreds or thousands of galaxies around a mass 
concentration are measured, statistical analysis 
shows the flattening. In the 2010’s weak lenses 
are routinely used to observe the distribution of 
the invisible matters. 

Figure 19.13 showed an example about dark 
matter clouds around a galaxy cluster. 


19.8 The Origin and Evolution of 
Galaxies 

Because the speed of light is finite we see distant 
galaxies at earlier stages in their life. In the next 
chapter we show how the age of a galaxy with 
given redshift can be calculated based on the rate 
of expansion of the Universe. However, this rela¬ 
tionship will depend on the cosmological model. 


which therefore always has to be specified when 
studying the evolution of galaxies. 

Evolution of galaxies began when the first 
stars were born at redshifts 10-20. The universe 
consisted of neutral hydrogen and helium only, 
and the temperature of the gas had dropped be¬ 
low 100 kelvins. The first stars were very mas¬ 
sive and shortlived. Radiation from the first stars, 
galaxies and quasars rapidly ionised the interstel¬ 
lar and intergalactic gas. This era is called the 
time of ionisation. Ionisation was almost com¬ 
plete when the redshift reached z — 6, and ever 
since 99.9 % of the hydrogen has been ionised. 
Towards the end of the reionisation, at redshifts 
6-15, massive galaxies (M > 10 10 Mq) were al¬ 
ready born. Current observations of galaxies go 
back to redshifts 10-12. 

According to the currently widely accepted 
cosmological models most of the matter in the 
Universe is in a form that emits no radiation, 
and is only observable from its gravitational ef¬ 
fects. In this Cold Dark Matter (CDM) theory 
(see Sect. 20.7) the first systems to collapse and 
start forming stars were small, with masses like 
those of dwarf galaxies. Larger galaxies were 
formed later as these smaller fragments collected 
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into larger clumps. This model, where most stars 
are formed in small galaxies is usually described 
as the hierarchical model. 

A large part of our theoretical ideas on galaxy 
evolution is based on numerical simulations of 
the collapse of gas clouds and star formation in 
them. Using some prescription for star formation 
one can try to compute the evolution of the spec¬ 
tral energy distribution and the chemical abun¬ 
dances in the resulting galaxies. The results of 
the models can be compared with the observa¬ 
tional data presented in the previous sections of 
this chapter. 

The density distribution of dark matter is ex¬ 
pected to be very irregular, containing numerous 
small-scale clumps. The collapse will therefore 
be highly inhomogeneous, both in the hierarchi¬ 
cal and in the monolithic picture, and subsequent 
mergers between smaller systems should be com¬ 
mon. There are additional complicating factors. 
Gas may be expelled from the galaxy, or there 
may be an influx of fresh gas. Interactions with 
the surroundings may radically alter the course of 
evolution—in dense systems they may lead to the 
complete merging of the individual galaxies into 
one giant elliptical. Much remains to be learned 
about how the formation of stars is affected by 
the general dynamical state of the galaxy and of 
how an active nucleus may influence the forma¬ 
tion process. 

Our observational knowledge of galaxy evo¬ 
lution is advancing very rapidly. Essentially all 
the relationships described earlier in this chap¬ 
ter have been studied as functions of time. Still, 
a complete generally accepted description of the 
way the Universe reached its present state has not 
yet been established. Here we can only mention 
a few of the most central aspects of the processes 
leading to the galaxies we observe to-day. 

Density and Luminosity Evolution The most 
basic way of studying the formation and evolu¬ 
tion of galaxies is by counting their numbers, ei¬ 
ther the number brighter than some given magni¬ 
tude limit (as was already done by Hubble in the 
1930’s, see Sect. 20.1) or else the number den¬ 
sity as a function of redshift. The counts can be 
compared with the numbers expected if there is 
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Fig. 19.33 Galaxy counts in the U, B, I, and K wave¬ 
length bands. The counts are compared to those in a cos¬ 
mological model without evolution. The cosmological pa¬ 
rameters correspond to the currently preferred “concor¬ 
dance” model (see Sect. 19.5). (H.C. Ferguson et al., 2000, 
ARAA 38, 667, Fig. 4) 



Fig. 19.34 The deepest image taken by the Hubble space 
telescope and published in 2014. The total exposure time 
is 27 days. The original picture contains about 15,000 faint 
galaxies. (NASA, ESA, H. Teplitz, M. Rafelski, A. Koeke- 
moer, R. Windhorst and Z. Levay) 

no evolution, which depend on the cosmological 
model. They can therefore be used either as a cos- 
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Fig. 19.35 Different early 
forms of galaxies in the 
redshift range z = 2-4. Top 
line: chain galaxies; second 
line: clump-clusters; third 
line: double galaxies; 
fourth line: tadpoles; fifth 
line: spiral galaxies being 
born; bottom line: elliptic 
galaxies. (Elmegreen, 
Elmegreen, Rubin, 

Schaffer, 2005, ApJ 631, 
p. 87) 
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mological test, or as a test of evolutionary mod¬ 
els. However, the present situation is that more 
reliable cosmological tests are available, and the 
number counts are mainly used to study the evo¬ 
lution of galaxies. 

There are two ways in which the number 
counts are affected by galaxy evolution. In den¬ 
sity evolution the actual number of galaxies is 
changing, whereas in luminosity evolution only 
the luminosity of individual galaxies is evolv¬ 
ing. The simplest form of luminosity evolution 


is called passive luminosity evolution , and is due 
to the changing luminosity of stars during nor¬ 
mal stellar evolution. Pure luminosity evolution 
is expected to be predominant in the monolithic 
picture. In the hierarchical picture density evolu¬ 
tion will be more prominent, since in this picture 
smaller galaxies are to a greater extent being de¬ 
stroyed to produce larger more luminous ones. 

Figure 19.33 gives an example of the results of 
number counts. A model without evolution can¬ 
not explain these observed counts, and various 
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models incorporating evolutionary effects have to 
be introduced. However, a unique model cannot 
be determined using just the number counts. 

Distant Galaxies A more direct approach to 
galaxy formation is the direct search for the most 
distant objects visible. In Fig. 19.34 we show the 
Hubble Ultra Deep Field (HUDF), an area of the 
sky observed with the Hubble Space Telescope. 
The total exposure time through different filters 
was 27 days. The image contains over 15,000 
galaxies. Over 5000 are seen as dots; their red- 
shift is aver 6. The other ones, seen as extended 
objects, give information on galaxy evolution at 
redshifts 1-5. 

The most important observation is that with in¬ 
creasing redshift the Hubble classification breaks 
down and finally disappears. Around redshift 
z = 3 the spiral galaxies resembling the current 
ones have disappeared almost completely. In¬ 
stead, there are different irregular and strange 
galaxies. 

Figure 19.35 shows examples of distant galax¬ 
ies in the range z = 2-4. Over 10 % are “chain 
galaxies”, about 20 % “clump clusters”, 15 % 
“doubles” and 10 % “tadpoles”. About 30 % are 
evolving spirals and 10 % ellipticals. 

Another important observation is that in the 
past galaxies were smaller than nowadays. Even 
if the mass or luminosity of a galaxy were sim¬ 
ilar to a modern galaxy its diameter at z = 3 is 
only one tenth. The most probable explanation is 
that the outer parts grow in several small mergers. 
Collision of a dwarf galaxy does not considerably 
increase the mass of the great galaxy but brings 
more stars to its outskirts increasing its diameter. 
The observation clearly indicates how important 
mutual collisions have been in the evolution of 
galaxies towards their current forms. 

Evolution of AGN The first clear indication of 
cosmic evolution was in the numbers of radio 
galaxies and quasars. Already in the late 1960’s 
it was becoming clear that the density of quasars 
was increasing dramatically towards higher red- 
shifts (Fig. 19.36). Roughly, the number density 
of quasars increases relative to the present den¬ 
sity by a factor 100 out to a broad maximum at 
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Fig. 19.36 Quasar activity at different redshifts accord¬ 
ing to X-ray and infrared observations (gray areas and the 
gray line). The black line corresponds to the stellar birth 
rate in Fig. 19.37. (Madau ja Dickinson, 2014. Ann. Rev. 
Astron. Astrophys. 52) 
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Fig. 19.37 Stellar birth rates at different redshifts. Dif¬ 
ferent symbols correspond to values obtained in different 
studies and their standard errors. (Madau and Dickinson. 
2014, Ann. Rev. Astron. Astrophys. 52) 

redshift about 2. The observed behaviour may be 
due to either density or luminosity evolution. The 
density of radio galaxies also has a maximum 
at redshifts about 1.5-3, which is sometimes re¬ 
ferred to as the quasar era. 

The Star Formation History of the Universe 

Since the Universe contained only neutral gas as 
the first stars began to form, the most general de¬ 
scription of how galaxies came to be is in terms 
of the rate at which the gas is being turned into 
stars. The star formation history going back to 
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a redshift about 8 is shown in Fig. 19.37. The star 
formation rate was about an order of magnitude 
larger than its current value at redshift 1-2. For 
even larger redshifts it seems to have remained 
nearly constant or decreased slowly. 


19.9 Exercises 

Exercise 19.1 The galaxy NGC 772 is an Sb spi¬ 
ral, similar to M31. Its angular diameter is T and 
apparent magnitude 12.0. The corresponding val¬ 
ues of M31 are 3.0° and 5.0. Find the ratio of the 
distances of the galaxies 


(a) assuming their sizes are equal, 

(b) assuming they are equally bright. 

Exercise 19.2 The brightness of the quasar 
3C279 has shown changes with the time scale of 
one week. Estimate the size of the region produc¬ 
ing the radiation. The apparent magnitude is 18. 
If the distance of the quasar is 2000 Mpc, what 
is its absolute magnitude and luminosity? How 
much energy is produced per au 3 7 
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Cosmology is a science studying the structure and 
evolution of the whole Universe. Research in cos¬ 
mology tries to answer questions such as: How 
large and how old is the Universe? How was it 
born? How is matter distributed? How were the 
elements formed? What will be the future of the 
Universe? 

After the demise of the Aristotelian world pic¬ 
ture, it took hundreds of years of astronomical ob¬ 
servations and physical theories to reach a level at 
which a satisfactory modern scientific picture of 
the physical universe could be formed. The de¬ 
cisive steps in the development were the clarifi¬ 
cation of the nature of the galaxies in the 1920’s 
and the general theory of relativity, which gives a 
basis for theoretical research of the universe, de¬ 
veloped by Einstein in the 1910’s. Yet, just a few 
decades ago there was a lot of room for armwaiv- 
ing and all kinds of speculations. Only recently 
the observations made with satellites have given 
reasonably precise values for some parameters 
and some alternative models could be excluded. 
The central tenet of modern cosmology is the 
model of the expanding universe, getting all the 
time more supporting observational evidence. 


20.1 Cosmological Observations 

The Olbers Paradox The simplest cosmolog¬ 
ical observation may be that the sky is dark at 
night. This fact was first noted by Johannes Ke¬ 
pler , who, in 1610, used it as evidence for a finite 
universe. As the idea of an infinite space filled 


with stars like the Sun became widespread in con¬ 
sequence of the Copernican revolution, the ques¬ 
tion of the dark night sky remained a problem. In 
the 18th and 19th centuries Edmond Halley, Loys 
de Cheseaux and Heinrich Olbers considered it in 
their writings. It has become known as the Olbers 
paradox (Fig. 20.1). 

The paradox is the following: Let us suppose 
the Universe is infinite and that the stars are uni¬ 
formly distributed in space. No matter in what di¬ 
rection one looks, sooner or later the line of sight 
will encounter the surface of a star. Since the sur¬ 
face brightness does not depend on distance, each 
point in the sky should appear to be as bright as 
the surface of the Sun. This clearly is not true. 
The modern explanation of the paradox is that the 
stars have only existed for a finite time, so that the 
light from very distant stars has not yet reached 
us. Rather than proving the world to be finite in 
space, the Olbers paradox has shown it to be of 
a finite age. 

Extragalactic Space In 1923 Edwin Hubble 
showed that the Andromeda Galaxy M31 was 
far outside the Milky Way, thus settling a long¬ 
standing controversy concerning the relationship 
between the nebulae and the Milky Way. The 
numerous galaxies seen in photographs form 
an extragalactic space vastly larger than the di¬ 
mensions of the Milky Way. It is important for 
cosmology that the distribution and motions of 
the basic components of extragalactic space, the 
galaxies and clusters of galaxies, should every¬ 
where be the same as in our local part of the 
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Fig. 20.1 The Olbers paradox. If the stars were uniformly 
distributed in an unending, unchanging space, the sky 
should be as bright as the surface of the Sun, since each 
line of sight would eventually meet the surface of a star. 

Universe. Galaxies generally occur in various 
systems, ranging from small groups to clusters 
of galaxies and even larger superclusters. The 
largest structures observed are about 100 Mpc in 
size (see Sect. 19.6). They are thus significantly 
smaller than the volume of space (a few thousand 
Mpc in size) in which the distribution of galaxies 
has been investigated. One way of studying the 
large-scale homogeneity of the galaxy distribu¬ 
tion is to count the number of galaxies brighter 
than some limiting magnitude m. If the galax¬ 
ies are uniformly distributed in space, this num¬ 
ber should be proportional to 10 06m (see Exam¬ 
ple 18.1). For example, the galaxy counts made 
by Hubble in 1934, which included 44,000 galax¬ 
ies, were consistent with a galaxy distribution in¬ 
dependent of position (homogeneity) and of di¬ 
rection (isotropy). Hubble found no “edge” of the 
Universe, nor have later galaxy counts found one. 

Similar counts have been made for extragalac- 
tic radio sources. (Instead of magnitudes, flux 
densities are used. If F is the flux density, then 
because m = — 2.5 lg(F/Fo), the number count 
will be proportional to F ~ 3 ^ 2 .) These counts 
mainly involve very distant radio galaxies and 
quasars (Fig. 20.2). The results seem to indicate 
that the radio sources were either much brighter 
or much more common at earlier epochs than at 
present (Sect. 19.8). This constitutes evidence in 
favour of an evolving, expanding universe. 

In general the simple geometric relation be¬ 
tween brightness and number counts will only 


A two-dimensional analogy can be found in an optically 
thick pine forest where the line of sight meets a trunk 
wherever one looks. (Photo M. Poutanen and H. Kart- 
tunen) 

hold for objects that are uniformly distributed in 
space. Focal inhomogeneities will cause depar¬ 
tures from the expected relationship. For more 
distant sources the geometry of the Universe as 
well as cosmic evolution will change the basic 
10 °- 6 "( behaviour. 

Hubble’s Law (See Fig. 20.3.) In the late 
1920’s, Hubble discovered that the spectral lines 
of galaxies were shifted towards the red by an 
amount proportional to their distances. If the red- 
shift is due to the Doppler effect, this means that 
the galaxies move away from each other with ve¬ 
locities proportional to their separations, i.e. that 
the Universe is expanding as a whole. 

In terms of the redshift z = (A. — Xq)/Xq, Hub¬ 
ble’s law can be written as 

z = (H/c)r , (20.1) 

where c is the speed of light, H is the Hub¬ 
ble constant and r the distance of the galaxy. 
For small velocities (V c) the Doppler redshift 
z = V/c, and hence 

V = Hr, (20.2) 

which is the most commonly used form of Hub¬ 
ble’s law. 

For a set of observed “standard candles”, i.e. 
galaxies whose absolute magnitudes are close 
to some mean Mo, Hubble’s law corresponds to 
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Fig. 20.2 The quasar 
3C295 and its spectrum. 
The quasars are among the 
most distant cosmological 
objects. (Photograph 
Palomar Observatory) 



a linear relationship between the apparent mag¬ 
nitude m and the logarithm of the redshift, Ig z. 
This is because a galaxy at distance r has an ap¬ 
parent magnitude m = Mq + 5 lg(r/10 pc), and 
hence Hubble’s law yields 

m = M 0 + 51g( CZ —) = 51gz + C, 
\H x 10 pc/ 

(20.3) 

where the constant C depends on H and Mq. 
Suitable standard candles are e.g. the brightest 
galaxies in clusters and Sc galaxies of a known 
luminosity class. Some other methods of dis¬ 
tance determination for galaxies were discussed 
in Sect. 19.2. Most recently, type la supernovae 
(Sect. 14.4) in distant galaxies have been used to 
determine distances out to the redshift z — 1.7, 


where departures from Hubble’s law are already 
detectable. 

If the Universe is expanding, the galaxies were 
once much nearer to each other. If the rate of ex¬ 
pansion had been unchanging, the inverse of the 
Hubble constant, T = H~ l , would represent the 
age of the Universe. If the expansion is gradually 
slowing down, the inverse Hubble constant gives 
an upper limit on the age of the Universe In fact, 
current indications (discussed later in this chap¬ 
ter) are that the rate of expansion is accelerating 
at present. In that case the age of the Universe 
may also be larger. However, H~ l will still be an 
estimate for the age of the Universe. 

One reason for the difficulty in determining 
the value of the Hubble constant is the uncer- 
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Fig. 20.3 Hubble’s law 
for type la supernovae. The 
solid curve represents the 
“concordance” model, with 
basic parameters having 
the values mentioned in 
Sect. 20.7. Models with a 
zero cosmological constant 
cannot explain the 
observations. (D. Rubin et 
al„ 2013, ApJ 763, 35) 



0.8 1.0 
Redshift 


tainty in extragalactic distances. A second prob¬ 
lem is that the measured values of the velocity V , 
corrected to take into account the motion of the 
Sun within the Local Group, contain a signifi¬ 
cant component due to the peculiar motions of the 
galaxies. These peculiar velocities are caused by 
local mass concentrations like groups and clus¬ 
ters of galaxies. It is possible that the Local Group 
has a significant velocity towards the centre of the 
Local Supercluster (the Virgo Cluster). Because 
the Virgo Cluster is often used to determine the 
value of H, neglecting this peculiar velocity leads 
to a large error in H. The size of the peculiar ve¬ 
locity is not yet well known, but it is probably 
about 250 kms -1 . 

The most ambitious recent project for deter¬ 
mining H used the Hubble Space Telescope in 
order to measure cepheid distances to a set of 
nearby galaxies. These distances were then used 
to calibrate other distance indicators, such as the 
Tully-Fisher relation and type la supernovae. The 
final result was H — (72 ± 8) kms -1 Mpc -1 . 
The largest remaining source of error in this result 
is the distance to the Large Magellanic Cloud, 
used for calibrating the cepheid luminosity. 

Nowadays the most accurate value for the 
Hubble constant is obtained by studying the 
angular power spectrum of the background 
radiation (Sect. 20.7). According to the mea- 



Fig. 20.4 It is nowadays assumed that at the beginning 
the expansion rate of the Universe was slowing down un¬ 
til about 5 billion years ago the deceleration changed to 
acceleration. The age of the Universe calculated from the 
current value of the Hubble constant is bigger than the ac¬ 
tual age 

surements of the Planck satellite H = (67.7 ± 
0.5) km s -1 Mpc 1 and the age of the Universe is 
13.80 ±0.02 Ga. 

The form of Hubble’s law might give the im¬ 
pression that the Milky Way is the centre of 
the expansion, in apparent contradiction with the 
Copernican principle. Figure 20.5 shows that, in 
fact, the same Hubble’s law is valid at each point 
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Fig. 20.5 A regular expansion according with Hubble’s 
law does not mean that the Milky Way (O) is the centre of 
the Universe. Observers at any other galaxy (O') will see 
the same Hubble flow (dashed lines) 

in a regularly expanding universe. There is no 
particular centre of expansion. 

The Thermal Microwave Background Radia¬ 
tion The most important cosmological discov¬ 
ery since Hubble’s law was made in 1965. In that 
year Arno Penzias and Robert Wilson discovered 
that there is a universal microwave radiation, with 
a spectrum corresponding to that of blackbody ra¬ 
diation (see Sect. 5.6) at a temperature of about 
3 K (Fig. 20.7). For their discovery, they received 
the Nobel prize in physics in 1979. 

The existence of a thermal cosmic radiation 
background had been predicted in the late 1940's 
by George Gamow , who was one of the first 
to study the initial phases of expansion of the 
Universe. According to Gamow, the Universe at 
that time was filled with extremely hot radia¬ 
tion. As it expanded, the radiation cooled, until at 
present, its temperature would be a few kelvins. 
After its discovery by Penzias and Wilson, the 
cosmic background radiation has been studied at 
wavelengths from 50 cm to 0.5 cm. The first de¬ 
tailed measurements, made from the COBE (Cos¬ 
mic Background Explorer) satellite showed that 
it corresponds closely to a Planck spectrum at 
(2.725 ± 0.002) K (Fig. 20.6). More recently the 
CMB has been mapped in even greater detail by 
the WMAP satellite. 

The existence of the thermal cosmic mi¬ 
crowave background (CMB) gives strong support 
to the belief that the Universe was extremely hot 
in its early stages. The background is very nearly 
isotropic, which supports the isotropic and ho¬ 
mogeneous models of the Universe. The COBE 



Fig. 20.6 Observations of the cosmic microwave back¬ 
ground radiation made by the COBE satellite in 1990 are 
in agreement with a blackbody law at 2.7 K 

and WMAP satellites have also detected temper¬ 
ature variations of a relative amplitude 6 x 10 -6 
in the background. These fluctuations are inter¬ 
preted as a gravitational redshift of the back¬ 
ground produced by the mass concentrations that 
would later give rise to the observed structures 
in the Universe. They are the direct traces of ini¬ 
tial irregularities in the big bang, and provide im¬ 
portant constraints for theories of galaxy forma¬ 
tion. Perhaps even more importantly, the ampli¬ 
tude of the fluctuations on different angular scales 
have provided crucial constraints on the cosmo¬ 
logical model. We shall return to this question in 
Sect. 20.7. 

The Isotropy of Matter and Radiation Apart 
from the CMB, several other phenomena con¬ 
firm the isotropy of the Universe. The distribu¬ 
tion of radio sources, the X-ray background, and 
faint distant galaxies, as well as Hubble’s law are 
all isotropic. The observed isotropy is also evi¬ 
dence that the Universe is homogeneous, since 
a large-scale inhomogeneity would be seen as an 
anisotropy. 

The Age of the Universe Estimates of the ages 
of the Earth, the Sun and of star clusters are im¬ 
portant cosmological observations that do not de¬ 
pend on specific cosmological models. From the 
decay of radioactive isotopes, the age of the Earth 
is estimated to be 4600 million years. The age of 
the Sun is thought to be slightly larger than this. 
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Fig. 20.7 The 

cosmological principle. In 
the small circle (A) about 
the observer (O) the 
distribution of galaxies 
does not yet represent the 
large-scale distribution. In 
the larger circle (B) the 
distribution is already 
uniform on the average 



The ages of the oldest star clusters in the Milky 
Way are 10-15 Ga. 

The values thus obtained give a lower limit to 
the age of the Universe. In an expanding universe, 
the inverse Hubble constant gives another esti¬ 
mate of that age. It is most remarkable that the 
directly determined ages of cosmic objects are so 
close to the age given by the Hubble constant. 
This is strong evidence that Hubble’s law really 
is due to the expansion of the Universe. It also 
shows that the oldest star clusters formed very 
early in the history of the Universe. 

The Relative Helium Abundance A cosmo¬ 
logical theory should also give an acceptable ac¬ 
count of the origin and abundances of the ele¬ 
ments. Even the abundance of the elementary par¬ 
ticles and the lack of antimatter are cosmological 
problems that have begun to be investigated in the 
context of theories of the early Universe. 

Observations show that the oldest objects con¬ 
tain about 25 % by mass of helium, the most 
abundant element after hydrogen. The amount of 
helium produced is sensitive to the temperature of 
the Universe, which is related to that of the back¬ 
ground radiation. The computations made for the 
standard models of the expanding Universe (the 
Friedmann models) yield a helium abundance of 
exactly the right size. 


20.2 The Cosmological Principle 

One hopes that as ever larger volumes of the Uni¬ 
verse are observed, its average properties will be¬ 
come simple and well defined. Figure 20.7 at¬ 
tempts to show this. It shows a distribution of 
galaxies on a plane. As the circle surrounding the 
observer O becomes larger, the mean density in¬ 
side the circle becomes practically independent 
of its size. The same behaviour occurs, regardless 
of the position of the centre of O: at close dis¬ 
tances, the density varies randomly (Fig. 20.8), 
but in a large enough volume, the average den¬ 
sity is constant. This is an example of the cosmo¬ 
logical principle: apart from local irregularities, 
the Universe looks the same from all positions in 
space. 

The cosmological principle is a far-reaching 
assumption, which has been invoked in order to 
put constraints on the large variety of possible 
cosmological theories. If in addition to the cos¬ 
mological principle one also assumes that the 
Universe is isotropic, then the only possible cos¬ 
mic flow is a global expansion. In that case, the 
local velocity difference V between two nearby 
points has to be directly proportional to their sep¬ 
aration (V = Hr ); i.e. Hubble’s law must apply. 

The plane universe of Fig. 20.7 is homoge¬ 
neous and isotropic, apart from local irregular- 
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Fig. 20.8 The galaxies 
seem to be distributed in a 
“foamlike” way. Dense 
strings and shells are 
surrounded by relatively 
empty regions. (Seldner, 
M. et al. (1977): Astron. J. 
82, 249) 



ities. Isotropy at each point implies homogene¬ 
ity, but homogeneity does not require isotropy. 
An example of an anisotropic, homogeneous uni¬ 
verse would be a model containing a constant 
magnetic field: because the field has a fixed di¬ 
rection, space cannot be isotropic. 

We have already seen that astronomical obser¬ 
vations support the homogeneity and isotropy of 
our observable neighbourhood, the metagalaxy. 
On the grounds of the cosmological principle, 
these properties may be extended to the whole of 
the Universe. 

The cosmological principle is closely related 
to the Copernican principle that our position in 
the Universe is in no way special. From this prin¬ 
ciple, it is only a short step to assume that on 
a large enough scale, the local properties of the 
metagalaxy are the same as the global properties 
of the Universe. 

Homogeneity and isotropy are important sim¬ 
plifying assumptions when trying to construct 
cosmological models which can be compared 


with local observations. They may therefore rea¬ 
sonably be adopted, at least as a preliminary hy¬ 
pothesis. 


20.3 Homogeneous and Isotropic 
Universes 


Under general conditions, space and time coordi¬ 
nates in a universe may be chosen so that the val¬ 
ues of the space coordinates of observers moving 
with the matter are constant. It can be shown that 
in a homogeneous and isotropic universe, the line 
element (Appendix B) then takes the form 


d s 2 = -c 2 d t 2 + R 2 (t) 


X 


dr 2 

1 — kr 2 


+ r 2 (d0 2 + cos 2 


d <t> 2 ) , 

(20.4) 


known as the Robertson-Walker line element. 
(The radial coordinate r is defined to be dimen¬ 
sionless.) R(t) is a time-dependent quantity rep¬ 
resenting the scale of the Universe. If R increases 
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Fig. 20.9 The two-dimensional analogues of the Fried¬ 
mann models: a spherical surface, a plane and a saddle 
surface 


with time, all distances, including those between 
galaxies, will grow. The coefficient k may be 
+ 1, 0 or — 1, corresponding to the three possi¬ 
ble geometries of space, the elliptic or closed, the 
parabolic and the hyperbolic or open model. 

The space described by these models need not 
be Euclidean, but can have positive or negative 
curvature. Depending on the curvature, the vol¬ 
ume of the universe may be finite or infinite. In 
neither case does it have a visible edge. 

The two-dimensional analogy to elliptical 
(k — +1) geometry is the surface of a sphere 
(Fig. 20.9): its surface area is finite, but has no 
edge. The scale factor R(t) represents the size of 
the sphere. When R changes, the distances be¬ 
tween points on the surface change in the same 
way. Similarly, a three-dimensional “spherical 
surface”, or the space of elliptical geometry, has 
a finite volume, but no edge. Starting off in an 
arbitrary direction and going on for long enough, 
one always returns to the initial point (unless the 
distance is increased too much by the expansion). 

When k = 0, space is flat or Euclidean, and the 
expression for the line element (20.4) is almost 
the same as in the Minkowski space. The only 
difference is the scale factor R(t). All distances 
in a Euclidean space change with time. The two- 
dimensional analogue of this space is a plane. 

The volume of space in the hyperbolic geom¬ 
etry (A: = — 1) is also infinite. A two-dimensional 
idea of the geometry in this case is given by a sad¬ 
dle surface or a horn becoming infinitely wider. 

In a homogeneous and isotropic universe, 
many physical quantities will depend on time 
through the scale factor R(t). For example, from 
the form of the line element, it is evident that all 
distances will be proportional to R (Fig. 20.10). 



Fig. 20.10 When space expands, all galaxy separations 
grow with the scale factor R\ r' = [R{tg)f R(tg)]r 


Thus, if the distance to a galaxy is r at time t , then 
at time to (in cosmology, the subscript 0 refers to 
the present value) it will be 


R(to ) 

m 


(20.5) 


Similarly, all volumes will be proportional to I? 3 . 
From this it follows that the density of any con¬ 
served quantity (e.g. mass) will behave as /? . 

It can be shown that the wavelength of ra¬ 
diation in an expanding universe is proportional 
to R, like all other lengths. If the wavelength at 
the time of emission, corresponding to the scale 
factor R, is X, then it will be ),q when the scale 
factor has increased to Rq : 


Xq _ *o 
T ~~R' 


( 20 . 6 ) 


The redshift is z = (Xq — X)/X, and hence 


, , Ro 

l+z =Y' 


(20.7) 


i.e. the redshift of a galaxy expresses how much 
the scale factor has changed since the light was 
emitted. For example, the light from a quasar 
with z — 1 was emitted at a time when all dis¬ 
tances were half their present values. 

For small values of the redshift, (20.7) ap¬ 
proaches the usual form of Hubble’s law. This can 
be seen as follows. When z is small, the change 
in R during the propagation of a light signal will 
also be small and proportional to the light travel 
time t. Because t = r/c approximately, where r 
is the distance of the source, the redshift will be 
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proportional to r. If the constant of proportional¬ 
ity is denoted by H/c, one has 

z = Hr/c. (20.8) 

This is formally identical to Hubble’s law (20.1). 
However, the redshift is now interpreted in the 
sense of (20.7). 

As the universe expands, the photons in the 
background radiation will also be redshifted. The 
energy of each photon is inversely proportional to 
its wavelength, and will therefore behave as R~ l . 
It can be shown that the number of photons will 
be conserved, and thus their number density will 
behave as /? -3 . Combining these two results, one 
finds that the energy density of the background 
radiation is proportional to R~ 4 . The energy den¬ 
sity of blackbody radiation is proportional to 7’ 4 , 
where T is the temperature. Thus the temperature 
of cosmic background radiation will vary as R~ l . 


20.4 The Friedmann Models 

The results of the preceding section are valid in 
any homogeneous and isotropic universe. In order 
to determine the precise time-dependence of the 
scale factor R(t) a theory of gravity is required. 

In 1917 Albert Einstein presented a model 
of the Universe based on his general theory of 
relativity. It described a geometrically symmet¬ 
ric (spherical) space with finite volume but no 
boundary. In accordance with the cosmologi¬ 
cal principle, the model was homogeneous and 
isotropic. It was also static : the volume of space 
did not change. 

In order to obtain a static model, Einstein had 
to introduce a new repulsive force, the cosmo¬ 
logical term , in his equations. The size of this 
cosmological term is given by the cosmological 
constant A. Einstein presented his model before 
the redshifts of the galaxies were known, and tak¬ 
ing the Universe to be static was then reasonable. 
When the expansion of the Universe was discov¬ 
ered, this argument in favour of the cosmological 
constant vanished. Einstein himself later called it 
the biggest blunder of his life. Nevertheless, the 
most recent observations now seem to indicate 
that a non-zero cosmological constant has to be 
present. 


The St. Petersburg physicist Alexander Fried¬ 
mann and later, independently, the Belgian Geor¬ 
ges Lemaitre studied the cosmological solutions 
of Einstein’s equations. If A — 0, only evolving, 
expanding or contracting models of the Universe 
are possible. From the Friedmann models exact 
formulas for the redshift and Hubble’s law may 
be derived. 

The general relativistic derivation of the law of 
expansion for the Friedmann models will not be 
given here. It is interesting that the existence of 
three types of models and their law of expansion 
can be derived from purely Newtonian consider¬ 
ations, with results in complete agreement with 
the relativistic treatment. The detailed derivation 
is given on Box 20.1, but the essential character 
of the motion can be obtained from a simple en¬ 
ergy argument. 

Let us consider a small expanding spherical re¬ 
gion in the Universe. In a spherical distribution of 
matter, the gravitational force on a given spher¬ 
ical shell depends only on the mass inside that 
shell. We shall here assume A = 0. 

We can now consider the motion of a galaxy 
of mass m at the edge of our spherical region. 
According to Hubble’s law, its velocity will be 
V = Hr and the corresponding kinetic energy, 

T = mV 2 /2. (20.9) 

The potential energy at the edge of a sphere of 
mass M is U = —GMm/r. Thus the total energy 
is 


E = T + U = m V 2 /l - GMm/r, (20.10) 


which has to be constant. If the mean density of 
the Universe is p, the mass is M = (47rr 3 /3)p. 
The value of p corresponding to E — 0 is called 
the critical density, p c . We have 


E = 


—mH 2 r 2 

2 

- mH 2 r 2 
2 


GMm 

r 

47t r 3 p c 

Gm - 

3 r 


= mr 


2 




( 20 . 11 ) 
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whence 

(2ai2) 

The expansion of the Universe can be com¬ 
pared to the motion of a mass launched vertically 
from the surface of a celestial body. The form of 
the orbit depends on the initial energy. In order 
to compute the complete orbit, the mass M of 
the main body and the initial velocity have to be 
known. In cosmology, the corresponding param¬ 
eters are the mean density and the Hubble con¬ 
stant. 

The E = 0 model corresponds to the Eu¬ 
clidean Friedmann model, the Einstein-de Sitter 
model. If the density exceeds the critical density, 
the expansion of any spherical region will turn 
to a contraction and it will collapse to a point. 
This corresponds to the closed Friedmann model. 
Finally, if p < p c , the ever expanding hyperbolic 
model is obtained. The behaviour of the scale fac¬ 
tor in these three cases is shown in Fig. 20.11 . 

These three models of the universe are called 
the standard models. They are the simplest rel¬ 
ativistic cosmological models for A = 0. Mod¬ 
els with A ^ 0 are mathematically more compli¬ 
cated, but show the same general behaviour. 

The simple Newtonian treatment of the expan¬ 
sion problem is possible because Newtonian me¬ 
chanics is approximately valid in small regions 
of the Universe. However, although the resulting 
equations are formally similar, the interpretation 



Fig. 20.11 The time dependence of the scale factor for 
different values of k. The cosmological constant A = 0 


of the quantities involved (e.g. the parameter k) 
is not the same as in the relativistic context. The 
global geometry of the Friedmann models can 
only be understood within the general theory of 
relativity. 

Consider two points at a separation r at the 
time t. Let their relative velocity be V . If the dis¬ 
tance at to is ro then 


r = 


R(t ) 

- ro 

R(to) 


and 


V = r = 


m 

-rn, 

R(to) 

(20.13) 


and thus the Hubble constant is 



m 

R(t) 


(20.14) 


The deceleration of the expansion is described 
by the deceleration parameter q , defined as 


q = -RR/R 2 . (20.15) 


The deceleration parameter describes the change 
of the rate of the expansion R. The additional fac¬ 
tors have been included in order to make it dimen¬ 
sionless, i.e. independent of the choice of units of 
length and time. 

The value of the deceleration parameter can 
be expressed in terms of the Hubble constant and 
density. Substitute the definition of H and the ex¬ 
pression of R (from Box 20.1) to the definition 
of q. We get 


4jtG Po^o 
3 R 3 H 2 ' 


(20.16) 


The density is usually expressed in terms of the 
density parameter £2 defined as £2 = p/Pc, so 
that £2 = 1 corresponds to the Einstein-de Sitter 
model. Thus 


0 _ 8j rG po^o 


(20.17) 


Hence there is a simple relation between £2 
and q: 

£2 = 2q. (20.18) 

The value q — 1/2 of the deceleration parameter 
corresponds to the critical density £2=1. Both 
quantities are in common use in cosmology. It 
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should be noted that the density and the decelera¬ 
tion can be observed independently. The validity 
of (20.18) is thus a test for the correctness of gen¬ 
eral relativity with A — 0. 


20.5 Cosmological Tests 

A central cosmological problem is the question of 
which Friedmann model best represents the real 
Universe. Different models make different obser¬ 
vational predictions. Recently there has been con¬ 
siderable progress in the determination of the cos¬ 
mological parameters, and for the first time there 
is now a set of parameters that appear capable 
of accounting for all observations. In the follow¬ 
ing, some possible tests will be considered. These 
tests are related to the average properties of the 
Universe. Further cosmological constraints can 
be obtained from the observed structures. These 
will be discussed in Sect. 20.7. 

The Critical Density If the average density p 
is larger than the critical density p c , the Uni¬ 
verse is closed. For the Hubble constant H — 
100 kms _1 Mpc _1 , the value of p c — 1.9 x 
10~ 26 kgm -3 , corresponding to roughly ten hy¬ 
drogen atoms per cubic metre. Mass determina¬ 
tions for individual galaxies lead to smaller val¬ 
ues for the density, favouring an open model. 
However, the density determined in this way 
is a lower limit, since there may be significant 
amounts of invisible mass outside the main parts 
of the galaxies. 

If most of the mass of clusters of galaxies 
is dark and invisible, it will increase the mean 
density nearer the critical value. Using the virial 
masses of X-ray clusters of galaxies (Sect. 19.2), 
one finds f2o = 0.3. Considerations of the ob¬ 
served velocities of clusters of galaxies indicate 
that the relative amount of dark matter does not 
increase further on even larger scales. 

Neutrinos have a small mass (about 10 -4 elec¬ 
tron mass). A large neutrino background should 
have been produced in the big bang. In spite of 
the small suggested mass of the neutrinos, they 
can still form a part of the mass of the Universe. 
Nowadays it is known that the mass of the neu¬ 
trino is too small to explain the dark matter prob¬ 
lem. Also, they move at relativistic speeds and are 


too “hot” to form the observed structures. Instead 
much recent work in cosmology has been based 
on the hypothesis of Cold Dark Matter (CDM), 
i.e. the idea that a significant part of the mass of 
the Universe is in the form of non-relativistic par¬ 
ticles of an unknown kind. 

It is understood that the mean density of the 
University cannot be reliably determined from 
the observations of the current constituents of the 
Universe. Instead, a very precise value for the 
mean density is found by studying the details of 
the cosmic background radiation, since that gives 
a single picture of the contents of the whole Uni¬ 
verse. The measurements show that the density is 
exactly the critical density, which means that the 
space is flat. The most recent result of Planck ob¬ 
tained in 2015 gives a curvature of 0.001 ± 0.004, 
meaning that the deviation from zero is at most 
0.5 %. 

The Magnitude-Redshift Test Although for 
small redshifts, all models predict the Hubble re¬ 
lationship m = 5 lg z + C for standard candles, for 
larger redshifts there are differences, depending 
on the deceleration parameter q. This provides 
a way of measuring q. 

The models predict that galaxies at a given 
redshift look brighter in the closed models than 
in the open ones (see Fig. 20.4). Measurements 
of type la supernovae out to redshifts z = 1.7 us¬ 
ing the Hubble Space Telescope have now shown 
that the observed q is inconsistent with models 
having A — 0. Assuming Q<) = 0.3 these obser¬ 
vations require Q a =0.7, where Qa is defined 
below in Box 20.1. 

The Angular Diameter-Redshift Test Along 
with the magnitude-redshift test, the relation be¬ 
tween angular diameter and redshift has been 
used as a cosmological test. Let us first consider 
how the angular diameter 6 of a standard object 
varies with distance in static models with differ¬ 
ent geometries. In a Euclidean geometry, the an¬ 
gular diameter is inversely proportional to the dis¬ 
tance. In an elliptical geometry, 6 decreases more 
slowly with distance, and even begins to increase 
beyond a certain point. The reason for this can be 
understood by thinking of the surface of a sphere. 
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For an observer at the pole, the angular diameter 
is the angle between two meridians marking the 
edges of his standard object. This angle is small¬ 
est when the object is at the equator, and grows 
towards the opposite pole. In a hyperbolic geom¬ 
etry, the angle 6 decreases more rapidly with dis¬ 
tance than in the Euclidean case. 

In an expanding closed universe the angular 
diameter should begin to increase at a redshift 
of about one. This effect has been looked for in 
the diameters of radio galaxies and quasars. No 
turnover has been observed, but this may also be 
due to evolution of the radio sources or to the 
selection of the observational data. At smaller 
redshifts, the use of the diameters of clusters of 
galaxies has yielded equally inconclusive results. 

Evolutionary effects are the most important 
factors for the uncertainty. We know that the 
sizes of large scale structures are changing while 
the Universe is evolving. The evolution of the 
structures has been investigated using cosmo¬ 
logical simulations (e.g. Millennium, Illustris 
Fig. 20.14). The simulations give estimates for 
the object sizes at different redshift, and these val¬ 
ues can be compared with the actual observations. 
The results are consistent with the cosmological 
ZlCDM concordance model. 

Basically the same idea can be applied to the 
angular scale of the strongest fluctuation in the 
cosmic microwave background. The linear size 
of these depends only weakly on the cosmolog¬ 
ical model and can therefore be treated as a stan¬ 
dard measuring rod. Their redshift is determined 
by the decoupling of matter and radiation (see 
Sect. 20.6). Observations of their angular size 
have provided strong evidence that f2o + Q a = 1 , 
i.e. the Universe is flat. 

Primordial Nucleosynthesis The standard mo¬ 
del predicts that 25 % of the mass of the Universe 
turned into helium in the “big bang”. This amount 
is not sensitive to the density and thus does not 
provide a strong cosmological test. However, the 
amount of deuterium left over from helium syn¬ 
thesis does depend strongly on the density. Al¬ 
most all deuterons formed in the big bang unite 
into helium nuclei. For a larger density the colli¬ 
sions destroying deuterium were more frequent. 


Thus a small present deuterium abundance indi¬ 
cates a high cosmological density. Similar argu¬ 
ments apply to the amounts of 3 He and 7 Li pro¬ 
duced in the big bang. The interpretation of the 
observed abundances is difficult, since they have 
been changed by later nuclear processes. Still, 
present results for the abundances of these nuclei 
are consistent with each other and with a den¬ 
sity corresponding to T?o about 0.04. Note that 
this number only refers to the mass in the form 
of baryons, i.e. protons and neutrons. Since the 
virial masses of clusters of galaxies indicate that 
f2o is about 0.3, this has stimulated models such 
as the CDM model, where most of the mass is not 
in the form of normal baryons. 

Ages The ages of different Friedmann models 
can be compared with known ages of various 
systems. The age to of a Friedmann model with 
given and Q is obtained by integrating equa¬ 
tion (8) of Box 20.1. This gives 

to — Hq 1 f da(Z2oa _1 + £2act + 1 

Jo 

- £2o- n A )~ l/1 . (20.19) 

This age is required to be larger than the ages of 
the oldest known astronomical objects. 

If the density is critical and A = 0, toHo — 
2/3. Thus if Ho — 75 kms -1 Mpc -1 , the age is 
to — 9 Ga. Larger values of Qo give smaller ages, 
whereas positive values of A lead to larger ages. 
It has been a source of embarrassment that the 
best values of H have tended to give an age for 
the Universe only marginally consistent with the 
ages of the oldest astronomical objects. With the 
introduction of a positive cosmological constant 
and a slight downward revision of stellar ages this 
problem has disappeared. In 2003 the WMAP re¬ 
sults gave an age of 13.7 Ga. A more recent value 
by the Planck satellite is 13.80 ± 0.02 Ga. 

The best current parameter values give 13- 
14 Ga for the age of the Universe. 

The “Concordance” Model In summary there 
has been a remarkable recent convergence be¬ 
tween different cosmological tests. The resulting 
model has a positive cosmological constant, and 
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most of the matter is cold and dark. It is thus re¬ 
ferred to as the ACDM model. The best parame¬ 
ter values are H — 70 kms -1 Mpc -1 , Qa = 0.7, 

= 0.3, with cold dark matter making up 85 % 
of the total density. 

The concordance model is by no means defini¬ 
tive. In particular the reason for the cosmological 
constant is a major puzzle. In order to allow for 
the possibility that A is variable it has become 
customary to refer to it as dark energy , saving the 
term cosmological constant for the case of a con¬ 
stant A. Even if some alternative mechanism can 
produce the same effect as a non-zero A, finding 
at least one set of acceptable parameters, thanks 
to the WMAP and Planck satellites, is an impor¬ 
tant step forward. 

An additional set of constraints on the cosmo¬ 
logical model comes from the large-scale struc¬ 
tures of the Universe. These constraints, which 
contribute further support for the concordance 
model, will be considered in Sect. 20.7. 


20.6 History of the Universe 

We have seen how the density of matter and of ra¬ 
diation energy and temperature can be computed 
as functions of the scale factor R. Since the scale 
factor is known as a function of time, it is possi¬ 
ble to calculate how these quantities change with 
time. 

When we back in time R is decreasing. Since 
p oc R~ , T oc R~ l , densities and temperatures 
at the beginning were so immense that all theo¬ 
ries about the physical processes taking place are 
highly conjectural. Nevertheless attempts have 
been made at understanding the most fundamen¬ 
tal properties of the Universe on the basis of 
modern theories of particle physics. For example, 
no indications of significant amounts of antimat¬ 
ter in the Universe have been discovered. Thus, 
for some reason, the number of matter particles 
must have exceeded that of antimatter particles by 
a factor of 1.000000001. Because of this symme¬ 
try breaking, when 99.9999999 % of the hadrons 
were annihilated, 10“ 7 % was left later to form 
galaxies and everything else. It has been specu¬ 
lated that the broken symmetry originated in par¬ 
ticle processes about 10 -35 s after the initial time. 



Fig. 20.12 The energy densities of matter and radiation 
decrease as the Universe expands. Nucleon-antinucleon 
pairs annihilate at 10~ 4 s; electron-positron pairs at 1 s 

The breaking of fundamental symmetries in 
the early Universe may lead to what is known as 
inflation of the Universe. In consequence of sym¬ 
metry breaking, the dominant energy density may 
be the zero-point energy of a quantum field. This 
energy density will lead to inflation, a strongly 
accelerated expansion, which will dilute irregu¬ 
larities and drive the density very close to the 
critical value. One may thus understand how the 
present homogeneity, isotropy and flatness of the 
Universe have come about. In the inflationary 
picture the Universe has to be very nearly flat, 
Qq + = 1. The inflationary models also make 

specific predictions for the form of the irregulari¬ 
ties in the CMB. These predictions are in general 
agreement with what has been observed. 

As the Universe expanded, the density and 
temperature decreased (Fig. 20.12) and condi¬ 
tions became such that known physical princi¬ 
ples can be applied. During the hot early stages, 
photons and massive particles were continually 
changing into each other: high-energy photons 
collided to produce particle-antiparticle pairs, 
which then were annihilated and produced pho¬ 
tons. As the Universe cooled, the photon ener¬ 
gies became too small to maintain this equilib¬ 
rium. There is a threshold temperature below 
which particles of a given type are no longer 
produced. For example, the threshold tempera¬ 
ture for hadrons (protons, neutrons and mesons) 
is T = 10 12 K, reached at the time t = 10 -4 s. 
Thus the present building blocks of the atomic 
nuclei, protons and neutrons, are relics from the 
time 10 _8 -10 -4 s, known as the hadron era. 
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Even before the hadron era, when the age of 
the Universe was about 10“ 12 s, the dark mat¬ 
ter particles forming about 5/6 of all matter were 
born. They were decoupled from radiation around 
1(T 9 s and started to form structures to which the 
ordinary matter started to condense hundred mil¬ 
lion years later. 

The Lepton Era In the time period 10 _4 -1 s, 
the lepton era, the photon energies were large 
enough to produce light particles, such as elec¬ 
tron-positron pairs. Because of matter-antimatter 
symmetry breaking, some of the electrons were 
left over to produce present astronomical bodies. 
During the lepton era neutrino decoupling took 
place. Previously the neutrinos had been kept 
in equilibrium with other particles by fast par¬ 
ticle reactions. As the density and temperature 
decreased, so did the reaction rates, and finally 
they could no longer keep the neutrinos in equi¬ 
librium. The neutrinos decoupled from other mat¬ 
ter and were left to propagate through space with¬ 
out any appreciable interactions. It has been cal¬ 
culated that there are at present 600 such cosmo¬ 
logical neutrinos per cubic centimetre, but their 
negligible interactions make them extremely dif¬ 
ficult to observe. After the neutrino decoupling 
the amount of free neutrons kept decreasing as 
they decayed to protons and electrons. 

The Radiation Era After the end of the lep¬ 
ton era, about 1 s after the initial time, the most 
important form of energy was electromagnetic ra¬ 
diation. This stage is called the radiation era. At 
its beginning the temperature was about 10 10 K 
and at its end, about one million years later, when 
the radiation energy density had dropped to that 
of the particles, it had fallen to about 40,000 de¬ 
grees. At the very beginning of the radiation era 
within a few hundred seconds helium was pro¬ 
duced. 

Just before the epoch of helium synthesis, the 
number ratio of free protons and neutrons was 
changing because of the decay of the free neu¬ 
trons. After about 100 s the temperature had 
dropped to about 10 9 K, which is low enough 
for deuterons to be formed. All remaining neu¬ 
trons were then incorporated in deuterons; these, 


in turn, were almost entirely consumed to pro¬ 
duce helium nuclei. Thus the amount of helium 
synthesised was determined by the number ratio 
of protons and neutrons at the time of deuterium 
production t — 100 s. Calculations show that this 
ratio was about 14 : 2. Thus, out of 16 nucleons, 
2 protons and 2 neutrons were incorporated in 
a helium nucleus. Consequently 4/16 = 25 % of 
the mass turned into helium. This is remarkably 
close to the measured primordial helium abun¬ 
dance. 

Only the isotopes 2 H, 3 He, 4 He and 7 Li were 
produced in appreciable numbers by nuclear pro¬ 
cesses in the big bang. The heavier elements have 
formed later in stellar interiors, in supernova ex¬ 
plosions and perhaps in energetic events in galac¬ 
tic nuclei. 

Radiation Decoupling. The Matter Era As 

we have seen, the mass density of radiation (ob¬ 
tained from the formula E = me 2 ) behaves as 
R~ 4 , whereas that of ordinary matter behaves as 
R~ 3 . Thus the radiation mass density decreases 
more rapidly. At the end of the radiation era it 
became smaller than the ordinary mass density. 
The matter era began, bringing with it the for¬ 
mation of galaxies, stars, planets and human life. 
At present, the mass density of radiation is much 
smaller than that of matter. Therefore the dynam¬ 
ics of the Universe is completely determined by 
the density of massive particles. 

Soon after the end of the radiation era, radia¬ 
tion decoupled from matter. This happened when 
the temperature had dropped to a few thousand 
degrees, and the protons and electrons combined 
to form hydrogen atoms. It was the beginning of 
the “dark ages” at redshifts z = 1000-100, be¬ 
fore stars and galaxies had formed, when the Uni¬ 
verse only contained dark matter, blackbody radi¬ 
ation, and slowly cooling neutral gas. 

At present, light can propagate freely through 
space. The world is transparent to radiation; the 
light from distant galaxies is weakened only by 
the r~ 2 law and by the redshift. Since there is 
no certain detection of absorption by neutral gas, 
there must have been a reionisation of the Uni¬ 
verse. It is thought that this occurred around z = 
5-10. 
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20.7 The Formation of Structure 

As we go backward in time from the present, the 
distances between galaxies and clusters of galax¬ 
ies become smaller. For example, the typical sep¬ 
aration between galaxies is 100 times their di¬ 
ameter. At the redshift z — 101 (R (now) /R(t) = 
100) most galaxies must have been practically in 
contact. For this reason galaxies in their present 
form cannot have existed much earlier than about 
z = 100. Since the stars were presumably formed 
after the galaxies, all present astronomical sys¬ 
tems must have formed later than this. 

It is thought that all observed structures in the 
Universe have arisen by the gravitational collapse 
of small overdensities. Whereas the presently ob¬ 
servable galaxies have undergone considerable 
evolution, which makes it difficult to deduce their 
initial state, on larger scales the density variations 
should still be small and easier to study. These 
are the structures considered in the present sec¬ 
tion. The later evolution of galaxies is discussed 
in Sect. 19.8, and the formation of the Milky Way 
in Sect. 18.5. 

The Statistical Description of Large-Scale 
Structure The departures from strict homo¬ 
geneity in the Universe are random in charac¬ 
ter, and must therefore be described using statis¬ 
tical methods. Perhaps the most straightforward 
way of doing this is to take regions of a given 
size, specified in terms of their mass, and give the 
probability distribution for relative density varia¬ 
tions on that scale. 

A second method is to consider the spatial 
separations between individual objects such as 
galaxies or clusters. The distribution of these sep¬ 
arations is used to define the correlation function, 
which is a measure of the clustering of the objects 
in question. 

A third method to describe large-scale fluctu¬ 
ations is by means of the power spectrum. Here 
the density variations (in space or in projection 
on the sky) are represented as sum of waves. The 
power spectrum is the squared amplitude of these 
waves as a function of wavelength. 

All three methods are representations of the 
density variations in the Universe, and they are 



Fig. 20.13 The motion of the Milky Way in relation to 
the microwave background can be seen in the measure¬ 
ments of WMAP. One side of the sky is darker (colder), 
and the other side is lighter (warmer). The horizontal 
stripe is the densest part of the Milky Way. (Photo NASA) 

theoretically closely related. However, in practice 
they are observed in different ways, and there¬ 
fore, which representation is most suitable de¬ 
pends on what kind of observations are being 
analysed. The density variations are usually de¬ 
scribed by means of a spectral index n and an 
amplitude ag to be introduced below. 

The Growth of Perturbations In order to de¬ 
scribe the growth of structures in the Universe, 
consider a given region containing the mass M. 
If its density is slightly larger than the mean den¬ 
sity, its expansion will be slightly slower than that 
of the rest of the Universe, and its relative over¬ 
density will grow. The rate of growth as a func¬ 
tion of mass depends on the relative importance 
of the material components of the Universe, dark 
matter, radiation, and ordinary baryonic matter. 

It is assumed that there is an initial distribu¬ 
tion of perturbations where the fluctuations with 
mass M have an amplitude that is proportional 
to M _ (" +3) / 6 . The spectral index n is a cosmo¬ 
logical parameter to be determined from observa¬ 
tions. 

The first step in structure formation is when 
a given mass comes within the horizon, i.e. when 
there has been enough time since the big bang 
for light signals to cross the given region. During 
the radiation era the horizon mass grows propor¬ 
tionally to t 3/1 , and thus the time at which the 
mass M comes within the horizon will be propor¬ 
tional to M 2//3 . Any perturbation will initially be 
larger than the horizon mass, and while this is the 
case it grows in proportion to t. Once a matter 
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perturbation comes inside the horizon its ampli¬ 
tude will remain constant. This constant ampli¬ 
tude will behave as /M _ (” +3 )/ 6 , which is propor¬ 
tional to If n = 1, the perturbations 

enter the horizon with an amplitude that is inde¬ 
pendent of mass. We shall see that the observed 
value of n is in fact very close to 1. 

Perturbations of both dark and baryonic mat¬ 
ter density will behave as described above during 
the radiation era. At the end of the radiation era at 
time f k q, when the mass densities of radiation and 
(non-relativistic) matter become equal, the ampli¬ 
tude of the perturbations will be given by the hori¬ 
zon mass Meq ~ 10 16 M Q for M Meq, and 
will be proportional to M~ («+3)/6 for M ^ Meq 
After equality the dark matter perturbations will 
be free to start growing again as t 2 / 3 , independent 
of mass. 

Unlike dark matter, ordinary baryonic matter 
perturbations cannot grow as long as the Universe 
remains ionised. Instead there is a minimum mass 
of a collapsing gas cloud given by the Jeans mass 
My 

p3/2 

( 20 - 2 °) 

where p and P are the density and pressure in the 
cloud (see Sect. 6.11). The value of Mj before 
decoupling was 

Mj = 10 18 M q (20.21) 

and after decoupling 

Mj = 10 5 M o . (20.22) 

The reason for the large difference is that before 
decoupling, matter feels the large radiation pres¬ 
sure (P — m/3, see Box 11.1). After decoupling, 
this pressure no longer affects the gas. 

The large Jeans mass before decoupling means 
that overdense regions of normal gas cannot start 
growing before z = 1000. Rather than growing 
they oscillate like sound waves. After decoupling 
a large range of masses become Jeans unsta¬ 
ble. By then density perturbations of dark matter 
have had time to grow, and the gas will therefore 
rapidly fall into their potential wells. The first 
stars will start forming in the collapsing regions, 
and will reionise the Universe. 


Fig. 20.14 Cosmological 
simulations are used to 
study the evolution of the 
structures of the Universe. 
The international Illustris 
simulation shows how the 
network of dark matter 
evolves from redshift z = 4 
(bottom frame) to z = 0 
(topmostframe). (Illustris 
Collaboration, www. 
illustris-project.org) 



Because the expansion of the Universe works 
against the collapse, the density of Jeans unstable 
regions grows rather slowly. In order to produce 
the observed systems, the density perturbations at 
decoupling cannot be too small. In models with¬ 
out dark matter the variations in the CMB pre¬ 
dicted on this basis tended to be too large. In the 
CDM model the predicted variations are of the 
expected amplitude. 

In the CDM model the amplitude of fluc¬ 
tuations on scales about Meq and smaller de¬ 
pends only weakly on mass. This is why the 
CDM model leads to an hierarchical description 
of structure formation. In this picture, systems of 
all masses above 10 5 Mq begin forming after de¬ 
coupling. Because smaller systems will collapse 
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more rapidly, they are the first to form, at redshifts 
about 20. Once the first sources of light, starbursts 
or AGNs, had formed, they could reionise the gas. 
This marked the ending of the dark ages at red- 
shifts z = 10-5. 

The redshift of reionisation is still not well 
known, and is therefore treated as a parameter to 
be determined in tests based on large-scale struc¬ 
tures. It is usually expressed by means of r, the 
optical depth to electron scattering of the back¬ 
ground radiation. A larger value of r corresponds 
to a higher electron density, implying reionisation 
at a higher redshift. Since galaxies can be seen at 
redshifts larger than 6, the corresponding value of 
r = 0.03 represents a minimum. The final results 
(2012) of the observations with WMAP gave r = 
0.081 ± 0.012, which would have corresponded 
to a redshift 10.1 ± 1.0. The final results (2015) 
of Planck the gave 8.8 ± 0.1 as the mean redshift 
of reionisation. 

One finally has to ask where the initial pertur¬ 
bations came from. An attractive feature of the 
inflationary model is that it makes specific pre¬ 
dictions for these initial perturbations, deriving 
them from quantum effects at very early times. 
In this way the observed properties of the largest 
astronomical systems contain information about 
the earliest stages of our Universe. 

Fluctuations of the Cosmic Microwave Back¬ 
ground One important way of studying the 
large-scale structure of the early Universe is by 
means of the irregularities of the cosmic mi¬ 
crowave background. The overdensities that were 
later to give rise to observed structures should 
also give rise to temperature variations of the 
CMB. 

The temperature variations in the microwave 
background have been mapped, first by the COBE 
satellite, then by WMAP (Fig. 20.13) and Planck. 
A map of the CMB according to Planck is shown 
in Fig. 20.15. The observed variations are in qual¬ 
itative agreement with the scenario for structure 
formation described above. 

A more quantitative view of the observations 
is provided by the power spectrum of the obser¬ 
vations shown in Fig. 20.16. This shows the am¬ 
plitude of the temperature variations as a function 
of angular scale on the sky. 



-500 500 iM (CMB) 


Fig. 20.15 Temperature distribution of the background 
radiation according to the measurements of the Planck 
satellite. Dark areas are colder, light areas wanner and 
more dense. (Planck Collaboration) 

The physical processes we have described give 
rise to the features in the power spectrum. Thus 
the first peak is produced by perturbations that 
have just had time to collapse to a maximum den¬ 
sity since equality of matter and radiation, be¬ 
fore bouncing back. The linear size of this peak 
does not depend strongly on the exact parame¬ 
ters of the model, and its position (angular scale) 
can therefore be used as a standard measuring rod 
in the diameter—redshift test. The second and 
third peaks in the power spectrum, the “acous¬ 
tic peaks”, correspond to perturbations that have 
bounced back and collapsed again, respectively. 
The positions and amplitudes of these features 
in the power spectrum depend on the cosmolog¬ 
ical parameters. Thus the WMAP observations 
of the CMB can be fitted using a model con¬ 
taining six free parameters: the Hubble constant, 
the densities of dark matter and baryons, the op¬ 
tical depth r, and the amplitude and shape 
n of the initial perturbations. The curvature of 
the Universe, 1 — f2o — Q ,\ can be set equal to 
zero, which determines the value of ■ The re¬ 
sults of WMAP and Planck have given values for 
these parameters in full agreement with the con¬ 
cordance model. 

Very Large Scale Structure After decoupling 
structures on different scales are free to grow. The 
important dividing line is whether a given over¬ 
density is still expanding with the rest of the Uni¬ 
verse or whether it has had time to recollapse. 
Systems that have recollapsed will fairly rapidly 
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Fig. 20.16 Angular power 
spectrum of the 
temperature variations of 
the cosmic microwave 
background from the 
results of the Planck 
satellite. This gives the 
square of the amplitude of 
the temperature variations 
on different angular scales. 
(Planck Collaboration) 



virialise, i.e. settle down into a stationary state. 
Such systems can be considered real astronomi¬ 
cal systems. 

The largest astronomical systems are clusters 
of galaxies. For some time there was controversy 
about whether even larger structures, superclus¬ 
ters, existed. The controversy was settled in the 
1970’s when it was realised that this largely de¬ 
pended on what was meant by superclusters. Re¬ 
gions of higher density existed on scales larger 
than clusters of galaxies (a few Mpc), all the way 
up to 100 Mpc, but they did not really form indi¬ 
vidual structures in approximate equilibrium like 
the clusters of galaxies. 

Between these two alternatives are structures 
that are only now turning round and beginning to 
recollapse. One way of specifying the amplitude 
of the initial fluctuations is to give the scale of this 
transition point. This scale corresponds roughly 
to a present linear size of 8 Mpc, and for this rea¬ 
son the fluctuation amplitude is commonly given 
by means of ag, the amplitude of the fluctuations 
at present scale 8 Mpc. Because of the way it has 
been chosen erg should be close to 1. 

Smaller systems will already have collapsed. 
There are many ways of statistically describing 
their large-scale distribution. One of the most 
studied descriptors of clustering is the corre¬ 
lation function , f(r). Consider two infinitesi¬ 
mal volumes dVi and dV 2 separated by the dis¬ 
tance r. If there were no clustering, the proba¬ 
bility of finding galaxies in these volumes would 


be A' 2 dF] dV 2 - where N is the number density 
of galaxies. Because of clustering this probability 
is actually A' 2 (I + £(r)) dVi dV 2 - The correlation 
function thus measures the higher probability of 
finding galaxies near each other. 

Although there are ways of estimating the cor¬ 
relation function from the distribution on the sky, 
a more reliable estimate can be made by mapping 
the distribution of galaxies in three dimensions, 
using redshifts to determine their distances. The 
distribution from one such survey (SDSS, Sloan 
Digital Sky Survey) is shown in Fig. 20.17. 

As expected, the general form of the correla¬ 
tion function is a power law (close pairs are abun¬ 
dant, distant pairs rarer). It has been known since 
the 1970’s that the correlation function is approx¬ 
imately proportional to r y . The constant of pro¬ 
portionality is related to the amplitude eg and the 
exponent y is related to the index n of the initial 
perturbations. The observed value of y is about 
1.8, depending to some extent on the sample of 
objects studied. 

At large separations the curve shows an un¬ 
expected maximum, observed for the first time 
in 2005. This is the same highest acoustic peak 
seen in the angular power spectrum of Planck. 
The observation confirms the ACDM cosmology 
and shows that the acoustic waves born around 
Z = 1000 have remained till the present time and 
are seen in the distribution of galaxies. The same 
peak has been seen later in the larger SDSS sur¬ 
vey. 
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Fig. 20.17 Correlation function for the distribution of 
47,000 bright red SDSS galaxies. The inset shows the 
found “bump” in more detail. The value of h is the Hubble 
constant divided by 100, or about 0.7. Thus a difference 
1 OOh ~ 1 corresponds to a distance of about 150 Mpc (about 
6 degrees on the sky). The bump tells about acoustic os¬ 
cillations of baryons, i.e. large scale acoustic waves of the 
visible matter. (Eisenstein et al„ 2005, ApJ 633, 560) 

In order to determine the cosmological param¬ 
eters the (three-dimensional) power spectrum of 
the galaxy distribution is usually used. It is then 
compared to theoretical power spectra that de¬ 
pend on the parameters in order to find the best¬ 
fitting model. The optical depth parameter r does 
not affect these tests. The results have been com¬ 
pletely consistent with the concordance model. 

The Concordance Model Again We have dis¬ 
cussed the use of the cosmic microwave back¬ 
ground and the large-scale distribution of galax¬ 
ies in order to determine the cosmological param¬ 
eters. Both methods independently give consis¬ 
tent results. Combining them produces even more 
(formally) accurate values. Furthermore, these 
values are in agreement with the ones obtained 
from the traditional cosmological tests discussed 
in Sect. 20.5. 

There are eight basic parameters describing 
the concordance model. According the final re¬ 
sults of Planck (2015) they are: 

Hubble constant Ho 6.7 ± 0.5 kms -1 Mpc -1 , 

baryon density parameter £2\, 0.049 ± 0.001, 


mass density parameter Qq 0.259 ± 0.006, 
redshift of reionisation z. n 8.8 ± 1.1, 
spectral index n 0.967 ± 0.004, 
fluctuation amplitude erg 0.82 ± 0.01. 

Other, less comprehensive tests, such as the 
gravitational lensing by cosmic structures, the 
velocities induced by mass concentrations, and 
the number of clusters of galaxies, have added 
support to the concordance model. This remark¬ 
able agreement between many independent deter¬ 
minations of the cosmological parameters have 
earned the model its name. Once the parameters 
necessary for a general description of the Uni¬ 
verse are reasonably well known, other compo¬ 
nents, such as gravitational waves or neutrinos, 
can be included in the cosmological model. This 
new development has been called the era of pre¬ 
cision cosmology. However, it should still not be 
forgotten that in the model one has to assume the 
presence of both dark matter and dark energy, nei¬ 
ther of which is based on any evidence apart from 
their role in cosmology. 


20.8 The Future of the Universe 

The standard models allow two alternative pros¬ 
pects for the future development of the Uni¬ 
verse. Expansion may either go on forever, or it 
may reverse to a contraction, where everything 
is eventually squeezed back to a point. In the 
final squeeze, the early history of the Universe 
would be repeated backwards: in turn, galaxies, 
stars, atoms and nucleons would be broken up. 
Finally the state of the Universe could no longer 
be treated by present-day physics. It is not known 
whether the collapse would continue to a single 
point or would the Universe avoid the singularity 
and start a new expansion. 

However, according to the current observa¬ 
tions, this is not the fate of the Universe. Instead, 
the expansion will continue forever, even at an 
accelerating pace. 

In the open models the future is quite differ¬ 
ent. The evolution of the stars may lead to one of 
four end results: a white dwarf, a neutron star or 
a black hole may be formed, or the star may be 
completely disrupted. After about 10 11 years, all 
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present stars will have used up their nuclear fuel 
and reached one of these four final states. 

Some of the stars will be ejected from their 
galaxies; others will form a dense cluster at the 
centre. In about 10 27 years the central star clus¬ 
ters will become so dense that a black hole is 
formed. Similarly the galaxies in large clusters 
will collide and form very massive black holes. 

Not even black holes last forever. By a quan¬ 
tum mechanical tunnelling process, mass can 
cross the event horizon and escape to infinity— 
the black hole is said to “evaporate”. The rate 
of this phenomenon, known as the Hawking pro¬ 
cess, is inversely proportional to the mass of the 
hole. For a galactic-mass black hole, the evapo¬ 
ration time is roughly 10 98 years. After this time, 
almost all black holes will have disappeared. 

The ever expanding space now contains black 
dwarfs, neutron stars and planet-size bodies (un¬ 
less the predictions of a finite proton lifetime of 
about 10 31 years are confirmed; in that case all 
these systems will have been destroyed by pro¬ 
ton decay). The temperature of the cosmic back¬ 
ground radiation will have dropped to 10 -2 ° K. 

Even further in the future, other quantum phe¬ 
nomena come into play. By a tunnelling pro¬ 
cess, black dwarfs can change into neutron stars 
and these, in turn, into black holes. In this way, 
all stars are turned into black holes, which will 
then evaporate. The time required has been esti¬ 
mated to be 10 10 years! At the end, only radi¬ 
ation cooling towards absolute zero will remain 
(Fig. 20.18). 



Fig. 20.18 According to current results the future of the 
Universe looks like this. After a very, very long time all 
matter will have changed to radiation. The wavelength of 
the radiation will increase infinitely in an ever expanding 
and cooling space 


It is of course highly doubtful whether our 
current cosmological theories really are secure 
enough to allow such far-reaching predictions. 
New theories and observations may completely 
change our present cosmological ideas. 


Box 20.1 (Newtonian Derivation of a Differ¬ 
ential Equation for the Scale Factor R(t)) Let 
us consider a galaxy at the edge of a massive 
sphere (see figure). It will be affected by a cen¬ 
tral force due to gravity and the cosmological 
force 


AttG r 3 pm 1 

mr = -=-1— mAr, 

3 r 2 3 


or 


4t r 1 

r — - Gpr + - Ar. 

3 3 


( 1 ) 




In these equations, the radius r and the den¬ 
sity p are changing with time. They may be ex¬ 
pressed in terms of the scale factor R: 

r = (R/R 0 )r 0 , (2) 

where R is defined to be Rq when the ra¬ 
dius r — ro. 


p = (R 0 /R) 3 po, (3) 


where the density p = po when R = Ro. Intro¬ 
ducing (2) and (3) in (1), one obtains 


R = - 


a 

Rl 


+ \a r . 


(4) 


where a = 4 ttGRqPo/3. If (4) is multiplied on 
both sides by R , the left-hand side yields 


. •• 1 d (R 2 ) 

RR= —-, 

2 dr 
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and thus (4) takes the form 

d{R 2 ) = -^dR+^ARdR. ( 5 ) 

Let us define Rq = R(1q). Integrating (5) from 
to to t gives 

R 2 — Rq —2a( -—^ 

0 \R RoJ 

+ 1 -A(R 2 -R 2 ). (6) 

The constants Rq and a can be eliminated in 
favour of the Hubble constant Hq and the den¬ 
sity parameter L?o- Because p c = 3//^/8 jtG, 

2 a = 87tG/?qPo/3 

= H 2 R 3 q Po / Pc = H 2 R 3 C2q, (7) 


where £2q = Pol Pc- Using expression (6) 
and R 0 = HqRq in (7), and defining P2a = 
A/(3Hq), one obtains 


R 2 Rq 

°¥ 




1 — ■ 


RoJ 

G/t 


( 8 ) 


as the basic differential equation govern¬ 
ing R(t). 

For simplicity we now set £2a — 0. Then the 
time behaviour of the scale factor R depends on 
the value of the density parameter £2q. Because 
R 2 > 0 always, according to (8) 




Rq 

R 


P2q -|- 1 ^ 0, 


or 

Rq > £2q — 1 
R £2o 

If £2 0 > 1, this means that 

£2q 

R S Ro~ 7 — ^max- 

L2q ~ I 


(9) 


When the scale factor reaches its maximum 
value R max . then according to (8), R = 0, and 
the expansion turns into contraction. If P2q < 1, 


the right-hand side of (8) is always positive and 
the expansion continues forever. 

The equation for the time dependence of the 
scale factor in the general theory of relativity 
contains the constant k which determines the 
geometry of space: 

8*GSj«, 

3 R 

Equations (10) and (6) (or (8)) can be made 
identical if one chooses 


H 2 R 2 (C2 0 - l) = kc 2 . 


Thus, complete agreement between the New¬ 
tonian and the relativistic equation for R is 
obtained. The values of the geometrical con¬ 
stant k = +1, 0, — 1 correspond respectively to 
£2q > 1, = 1 and < 1. More generally, the con¬ 
dition for a flat model, k = 0, corresponds to 
^0 + & A — 1- 

When k — 0, the time dependence of the ex¬ 
pansion is very simple. Setting £2q — 1 and us¬ 
ing (10) and (7), one obtains 


j l 2_H64 

R 


The solution of this equation is 


/ 3// 0 f \ 2/3 

R = {-r) 


(id 


It is also easy to calculate the time from the 
beginning of the expansion: R = Rq at the time 


to '■ 


2 1 
3 Hq 


This is the age of the Universe in the Einstein- 
de Sitter model. 


Box 20.2 (Three Redshifts) The redshift of 
a distant galaxy is the result of three different 
mechanisms acting together. The first one is the 
peculiar velocity of the observer with respect 
to the mean expansion: the Earth moves about 
the Sun, the Sun about the centre of the Milky 
Way, and the Milky Way and the Local Group 
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of galaxies is falling towards the Virgo Cluster. 
The apparatus measuring the light from a dis¬ 
tant galaxy is not at rest; the velocity of the 
instrument gives rise to a Doppler shift that 
has to be corrected for. Usually the velocities 
are much smaller than the speed of light. The 
Doppler shift is then 


zd = v/c. (1) 

For large velocities the relativistic formula has 
to be used: 


ZD = 



( 2 ) 


The redshift appearing in Hubble’s law is the 
cosmological redshift z c - It only depends on the 
values of the scale factor at the times of emis¬ 
sion and detection of the radiation (R and Rq) 
according to 


z c = R 0 /R-l. (3) 


The third type of redshift is the gravitational 
redshift z.g. According to general relativity, 
light will be redshifted by a gravitational held. 
For example, the redshift of radiation from the 
surface of a star of radius R and mass M will 
be 


1 


(4) 


where R$ — 2GM/c 2 is the Schwarzschild ra¬ 
dius of the star. The gravitational redshift of the 
radiation from galaxies is normally insignifi¬ 
cant. 

The combined effect of the redshifts can be 
calculated as follows. If the rest wavelength A.o 
is redshifted by the amounts zi and Z 2 by two 
different processes, so that 


A.2 — Aq 



^2 _ ^ _ A-2 At 

lo A.i Ao 


or 


(1 + Z ) = ( 1 + Z 1 )( 1 + Z2 ). 


Similarly, the three redshifts zu , Zc and Zg will 
combine to give an observed redshift z, accord¬ 
ing to 

1 + Z = (l + ZD)(1 + Z c )(l + Zg). (5) 


20.9 Examples 

Example 20.1 (a) In a forest there are /? trees 
per hectare, evenly spaced. The thickness of each 
trunk is D. What is the distance of the wood not 
seen for the trees? (Find the probability that the 
line of sight will hit a trunk within a distance x .) 
(b) How is this related to the Olbers paradox? 

(a) Imagine a circle with radius x around the ob¬ 
server. A fraction six), 0 < s(x) < 1, is cov¬ 
ered by trees. Then we’ll move a distance d.v 
outward, and draw another circle. There are 
2jrnxdx trees growing in the annulus lim¬ 
ited by these two circles. They hide a dis¬ 
tance 2tx xnDdx or a fraction nDdx of the 
perimeter of the circle. Since a fraction s (x ) 
was already hidden, the contribution is only 
(1 — s(x))nD d.v. We get 

s(x + dx) = s(x) + (l — s(x))nD dx, 
which gives a differential equation for .y: 

This is a separable equation, which can be in¬ 
tegrated: 

f s d.y f x 

/ -= / nD d.v. 

io Jo 

This yields the solution 

s(x)=l-e~ nDx . 

This is the probability that in a random direc¬ 
tion we can see at most to a distance x. This 
function s is a cumulative probability distri¬ 
bution. The corresponding probability den¬ 
sity is its derivative d.v/dx. The mean free 
path X is the expectation of this distribution: 
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For example, if there are 2000 trees per 
hectare, and each trunk is 10 cm thick, we 
can see to a distance of 50 m, on the average, 
(b) The result can easily be generalised into three 
dimensions. Assume there are n stars per unit 
volume, and each has a diameter D and sur¬ 
face A — uD 2 perpendicular to the line of 
sight. Then we have 

s(x) = l-e~ nAx , 

where 

X = 1 In A. 

For example, if there were one sun per cubic 
parsec, the mean free path would be 1.6 x 
10 4 parsecs. If the universe were infinitely 
old and infinite in size, the line of sight would 
eventually meet a stellar surface in any direc¬ 
tion, although we could see very far indeed. 

Example 20.2 Find the photon density of the 
2.7 K background radiation. 

The intensity of the radiation is 

_ 2 hv 3 1 

Bv ~ c 2 e hv/(kT) _ i 

and the energy density 

Atc Snliv 3 1 
Uv = ~ Bv = c 3 e hv/(kT) _ ] ' 

The number of photons per unit volume is found 
by dividing the energy density by the energy of 
a single photon, and integrating over all frequen¬ 
cies: 

f°° u v dv 87r f°° 2 dv 

N = ] 0 = ” efiv/ikT) _ j • 


We substitute hv/kT =x and dv = (kT/h.) dx: 



The integral cannot be expressed in terms of ele¬ 
mentary functions (however, it can be expressed 
as an infinite sum 2£]^l 0 (l/n 3 )), but it can be 
evaluated numerically. Its value is 2.4041. Thus 
the photon density at 2.7 K is 

N 

1.3805 x 10 -23 JK -1 x 2.7 K \ 3 
6.6256 x 10“ 34 Js x 2.9979 x 10 8 ms” 1 / 
x 1.20206 

= 3.99 x 10 8 in -3 % 400 cm -3 . 


20.10 Exercises 

Exercise 20.1 The apparent diameter of the gal¬ 
axy NGC3159 is 1.3', apparent magnitude 14.4, 
and radial velocity with respect to the Milky Way 
6940 km s~ 1 . Find the distance, diameter and ab¬ 
solute magnitude of the galaxy. What potential 
sources of error can you think of? 

Exercise 20.2 The radial velocity of NGC 772 
is 2562 kms -1 . Compute the distance obtained 
from this information and compare the result with 
Exercise 18.1. 

Exercise 20.3 If the neutrinos have nonzero 
mass, the universe can be closed. What is the min¬ 
imum mass needed for this? Assume that A — 0, 
the density of neutrinos is 600 cm -3 , and the den¬ 
sity of other matter is one tenth of the critical 
density. 
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Is there life elsewhere in the universe? And if 
there is, are there any intelligent beings? How 
did life emerge on the Earth, and, as a matter 
of fact, how do we define life and intelligence? 
These are probably the most interesting unsolved 
questions in science. During the last few decades 
a whole new field of astrobiology has evolved 
around these problems. 


21.1 What Is Life? 

There have been several science fiction tv-series 
showing extraterrestrial life forms. Usually the 
sentient beings have appeared quite humanlike, 
except of some appendices or other rather trivial 
features that try to make them unattractive. They 
may even have sex with humans. However, if the 
foreign life forms have evolved independently, in¬ 
terbreeding would be totally impossible. 

The chemistry of foreign beings could be to¬ 
tally different from ours. Could we even under¬ 
stand that they are living beings? In fact, what is 
life? It seems that life is an elusive concept, dif¬ 
ficult to define in terms of just a few properties. 
We have only one example of life, and therefore 
it is difficult to make general conclusions of its 
properties. However, we can assume that certain 
properties of the known life forms may be gener¬ 
alised also to foreign life. 

Common features of all terrestrial life forms 
are reproduction and evolution. If living be¬ 
ings produced exact replicas of themselves, there 
would be no evolution and no adaptation to 
changing environments. Thus the reproduction 


process must be slightly imperfect leading to 
a variety of descendants. This will give material 
for the natural selection, ‘survival of the fittest’. 

Natural selection is a fairly general principle, 
working in some sense also outside of biology. 
Are there any other general principles related to 
the evolution of life? If we could find even one ex¬ 
ample of life that evolved independently of ours, 
this would vastly improve our knowledge. 

Energy consumption is also characteristic of 
life. Life requires increasing order, i.e. decreasing 
entropy. Local decrease of entropy is not against 
thermodynamics: it only means that a living be¬ 
ing must be able to take energy in some form 
and utilise it for reproduction, growth, motion or 
other purposes. 

To produce similar offsprings a living being 
must have the ability to store information and 
pass it to its descendants. All terrestrial life forms 
use DNA or RNA molecules composed of nu¬ 
cleotides for storing information (see next sec¬ 
tion). 

Carbon can combine to form very complex 
molecules. Silicon can also form large molecules 
but they are not as stable as carbon compounds, 
and silicon cannot form rings like carbon. Maybe 
some simple life forms could be based on silicon, 
or on something quite different that we have not 
even thought of. 

Also a liquid solvent is needed. Our own life 
would not be possible without water. It remains 
liquid in a much wider temperature range than 
most other substances, which makes it a good sol¬ 
vent. Yet in the astronomical sense the tempera- 
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ture range is rather limited. In a colder environ¬ 
ment, methane or ammonium might act as the sol¬ 
vent. 

The basic building block of all terrestrial life 
forms is the cell. It has a membrane surrounding 
liquid cytoplasm. The cell membrane is semiper- 
meable and functions as a two-way filter that lets 
certain molecules go in and others come out; this 
selective transport is mediated via specific pro¬ 
teinaceous channels. There are two kinds of cells, 
simpler prokaryotic cells and more complex eu¬ 
karyotic cells. In eukaryotic cells the genetic ma¬ 
terial, in the form of DNA molecules, is inside 
a nucleus, surrounded by a nuclear membrane. In 
the prokaryotic cells there is no separate nucleus, 
and the DNA floats coiled in the cytoplasm. 

Terrestrial life is divided into three domains, 
Bacteria , Archaea and Eukarya. Both Bacteria 
and Archaea contain usually a single prokaryotic 
cell. Eukarya contains all more complex beings, 
like animals and plants. 

According to this scheme viruses are not 
alive although they have certain properties com¬ 
mon to living beings. There are also some other 
molecules, such as viroids and prions, that are not 
classified as living; yet they are not quite inani¬ 
mate. 

If even the definition of terrestrial life leads to 
such borderline cases, a more general definition 
of all possible kinds of life is truly challenging. 
Facing this problem we have to restrict our dis¬ 
cussion to life that, at least to some extent, re¬ 
sembles our own. 


21.2 Chemistry of Life 

The set of really important elements is relatively 
small; it includes hydrogen (H), oxygen (O), ni¬ 
trogen (N), carbon (C), sulphur (S) and phosphor 
(P). The heavier elements can be remembered by 
the mnemonic SPONC. 

The importance of carbon is in its ability to 
make lots of different very complex molecules, 
which are essential for life. There are three ba¬ 
sic types of molecules that function as common 
building blocks to all life: lipids for membranes, 
nucleotides and amino acids. 


Amino acids consist of three different compo¬ 
nents, a carboxyl (COOH), an amine part (NH 2 ) 
and a side chain, which can be just a single hydro¬ 
gen or a more complex structure. Altogether there 
are dozens of different amino acids, but only 20 
of them are used in genetically coded proteins. 

Amino acids can join to form more com¬ 
plex molecules, proteins. Typically, up to sev¬ 
eral hundred amino acids are needed for a pro¬ 
tein molecule. Proteins have numerous functions: 
they support structures, they act as catalysts in 
nearly all biological reactions, in which case they 
are called enzymes , they carry messages as hor¬ 
mones, and so on. 

Nucleotides are basic building blocks of the 
genetic material, DNA and RNA. A nucleotide 
has three components, a sugar, a phosphate group 
and a base. The phosphate parts are always the 
same. All DNA molecules have the same sugar 
part; also all RNA molecules have the same sugar 
part, which, however, differs from the sugar of the 
DNA molecules by having one additional oxygen 
atom. The base can be one of five different types, 
adenine (A), guanine (G), cytosine (C), thymine 
(T), or uracil (U). DNA molecules contain bases 
A, G, C and T, and RNA molecules A, G, C 
and U. 

Nucleotides join to form long chains. The de¬ 
oxyribonucleic acid or DNA (Fig. 21.1) consists 
of two such chains that are bound together as two 
intertwined helices. The corresponding bases join 
together by hydrogen bonds. The bases come al¬ 
ways in matching pairs, AT, TA, CG, or GC. 

A helix is like a screw that can be either left- or 
righthanded. All terrestrial DNA molecules have 
the same handedness or chirality: they are all of 
L-type, or lefthanded. The reason for the asym¬ 
metry is not quite understood. 

The DNA molecule contains information on 
how to make proteins. Three consecutive base 
pairs form a code, called a codon , that speci¬ 
fies one amino acid. Usually thousands of such 
triplets together contain instructions for building 
a protein. 

The basic units of heredity are called genes. 
They are regions of DNA whose final products 
are either proteins or RNA molecules. 

Human cells contain some 25,000 genes, and 
the DNA consists of 3 x 10 9 base pairs. In plants 
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Fig. 21.1 (a) The DNA molecule consists of two strands 
of alternating phosphate groups and sugars connected 
by matching base pairs. When the information stored in 
a fragment of the DNA is needed, the strands separate and 
the bases are used to produce an RNA molecule that will 
contain the same information, (b) The strands coil to form 
a double helix. (Webb: Where is everybody?) 

the numbers can be much higher, but the simplest 
known bacterial genomes have only a few hun¬ 
dred genes. It has been estimated that the mini¬ 
mum number of genes needed for a living being 
is about 200-300. 

Actually, only a part of the DNA contains ge¬ 
netic information. The rest is called junk-DNA 
since it has no known function. The fraction of 



Fig. 21.1 (Continued) 

the junk-DNA is highly variable; in some bacte¬ 
ria the amount is very small. 

The DNA is the storage of the genetic code, 
but the code is not functional directly from the 
DNA. Instead a complex molecular translation 
machinery is needed to execute the instructions. 

First, the instructions encoded in the genetic 
sequence are copied (or transcribed) into another 
type of nucleic acid. This ribonucleic acid or 
RNA resembles the DNA but has one more oxy¬ 
gen atom in its sugar. Different RNA molecules 
serve several roles in the translation process. 
Messenger RNA (mRNA) carries the information 
in the DNA to an organ called ribosome, com¬ 
posed of several RNA’s and multiple proteins. 
There another RNA, the transfer RNA (tRNA), 
brings amino acids into the reaction site, and 
still another RNA molecule, or one of the ribo- 
somal RNA’s (rRNA) forms the linkages (peptide 
bonds) between the adjacent amino acids. 

RNA molecules have to carry instructions for 
making just one protein, and thus they are much 
shorter than DNA molecules. Still they contain 
similar information, and in some simple life 
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forms like viruses they can act as the storage of 
the genetic code. 


21.3 Prerequisites of Life 

Just after the big bang there were hardly any other 
elements than hydrogen and helium. Heavier ele¬ 
ments are needed, both for life itself and for solid 
planets on which life can evolve. Thus at least 
some of the earliest stars had to explode as su¬ 
pernovas and eject their fusion products to the in¬ 
terstellar space. 

Most stars in elliptic galaxies and globular 
clusters are very old and have a low metallicity. 
Therefore they are not very probable locations for 
life. The most suitable places for life seem to be 
the disk populations of spiral galaxies, contain¬ 
ing young stars with high abundances of heavier 
elements. 

Not all places of the galactic disk are equally 
profitable. Far out from the galactic centre the 
star birth rate and consequently the metallicity is 
low. Close to the centre, metallicity is high but 
the environment is rather hostile. Star density is 
very high, and therefore radiation is intense, and 
nearby stars disturb planetary orbits. A rough es¬ 
timate is that only about 20 % of the stars of 
the Milky Way or a similar galaxy lie within the 
galactic habitable zone. 

Assume that a star is in the galactic habitable 
zone. The star has its own habitable zone where 
habitable planets can exist. This zone is usually 
defined as the region where the temperature is be¬ 
tween the freezing and boiling points of water. 
For a fast-rotating perfect blackbody (A = 0 in 
Eq. (7.51)) orbiting the Sun this would mean that 
its distance should be in the range 0.56-1.04 au. 
Planets, however, are not blackbodies, and the 
real situation is more complicated. If a planet has 
a high albedo, it reflects away most of the inci¬ 
dent radiation and its temperature will be much 
lower. But depending on the chemical composi¬ 
tion of the atmosphere and possible clouds the 
greenhouse effect may increase the temperature 
considerably. Many gases are transparent to visi¬ 
ble light, allowing it to heat the surface. Most of 
the energy is emitted back in the infrared region, 


which is effectively absorbed by these green¬ 
house gases, such as water vapour, carbon diox¬ 
ide and methane, and thus remains trapped in the 
atmosphere. 

If the star is cool, the habitable zone is very 
narrow. Hot stars have wider habitable zones, but 
their main sequence phase is short, giving little 
time for life to evolve. Thus main sequence stars 
not too different from the Sun are usually con¬ 
sidered the best candidates for having habitable 
planets. 

During the main sequence phase a star will be¬ 
come a little brighter, which will push the hab¬ 
itable zone slightly outwards. Thus the region 
where the temperature remains suitable for a long 
period of time is narrower than the habitable zone 
at any given moment. The continuously habitable 
zone can be defined as the region that remains 
habitable for a time that is comparable to the main 
sequence phase of the star. For the Sun the es¬ 
timates of the width of this region vary at least 
from 0.06 au up to 0.2 au. The problem is how to 
model albedo and greenhouse effects over a very 
long period of time. 

Binary stars are very common, but at least ear¬ 
lier it was thought that they could not have habit¬ 
able planets, since planetary orbits would be com¬ 
plicated or unstable. However, there are two kinds 
of possible orbits that might be suitable. If the 
components of a binary are far away from each 
other, each component could have a planetary 
system of its own. Or, if it’s a close binary, there 
can be distant planets orbiting the whole binary 
system. 


21.4 Hazards 

Even if life could emerge, there are many hazards 
that may wipe it out. By looking at the Moon we 
can see that meteor bombardment was very in¬ 
tense in the young solar system. Collision of a big 
asteroid or comet could be fatal, the immediate 
devastation caused by the explosion being only 
one of the consequences. The collision would 
eject a lot of dust to the atmosphere cooling the 
climate for several years. The mass extinction 65 
million years ago seems to have been caused by 
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Fig. 21.2 In the young solar system small objects col¬ 
lided continuously to larger planets, threatening seriously 
the evolution of life. Later collisions and perturbations by 
planets have cleaned away most of the potentially danger¬ 
ous objects. Still collisions take place even nowadays. In 
1994 comet Shoemaker-Levy collided to lupiter after dis¬ 
integrating into several parts. Traces of the collisions of 
the fragments were seen as dark spots in the atmosphere of 
Jupiter. In June 20, 1908, an explosion happened in Tun- 
guska, Siberia, hewing down trees in a couple of thousand 
square kilometres. The explosion was possibly caused by 
a comet with a diameter of about hundred metres 


such an event. As the comet Shoemaker-Levy hit¬ 
ting Jupiter (Fig. 21.2) in 1994 showed, such col¬ 
lisions are still possible. Fortunately they are not 
very frequent any more. 

Almost all of the currently known exoplanets 
are Jupiter-like giants. They seem to be neces¬ 


sary for habitable planets, because their pertur¬ 
bations clean the young solar system from debris 
by ejecting it outside the planetary system. How¬ 
ever, many of the known giant planets move on 
highly eccentric orbits, and they may disturb also 
the orbits of earthlike planets. Thus it is further 
required that the giant planets should be on nearly 
circular orbits and not too close to the star. 

Also smaller planets have participated in clear¬ 
ing the regions around their orbits, which is re¬ 
flected in the new definition of a planet (Sect. 7.1). 

Seasons depend on the obliquity of the rota¬ 
tion axis and the eccentricity of the orbit. High 
values will lead to strong seasonal temperature 
variations. In the case of the Earth the Moon 
seems to have a stabilising effect; without the 
Moon the tilt of the axis would have varied much 
more, possibly causing more severe ice ages fatal 
to life. Hence also a relatively big moon seems to 
be in the shopping list of a habitable planet. How¬ 
ever, recently there have been some objections to 
this requirement. 

We have only recently started to understand 
the rather delicate balance and complex feedback 
effects working in the atmosphere. Currently the 
climate is warming due to the increasing green¬ 
house effect, but the Earth has experienced also 
quite opposite phases. If the albedo increases, the 
amount of energy reaching the surface decreases, 
glaciers and snow cover expand, and the amount 
of clouds increases till most of the atmospheric 
humidity is solidified as snow and ice. All this 
will increase the albedo further, speeding up this 
icehouse effect. There is geological evidence of 
global glaciation periods 750-580 million years 
ago and possibly also 2.3 billion years ago. The 
Snowball Earth hypothesis assumes that the cli¬ 
mate cooled down for millions of years and the 
whole surface was covered by a layer of ice at 
least one kilometer thick. During the long cold 
period most of the living organisms, all of which 
at that time lived in water, became extinct. Vol¬ 
canic activity was still going on, adding more car¬ 
bon dioxide to the atmosphere. Finally the result¬ 
ing greenhouse effect started to warm the climate. 

There are many factors that seem to be crucial 
for life. Some of them may not look very impor¬ 
tant, but might still have made it impossible for 
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life to emerge. However, in many cases we don’t 
know how important they really are or if they are 
equally crucial for foreign life forms. 


21.5 Origin of Life 

One way to try to understand the origin of the ter¬ 
restrial life is to start with the available atoms and 
molecules and see if they could produce life. Dur¬ 
ing the last decades there has been considerable 
progress, but the process is very complicated and 
not yet well understood. Here we can only outline 
briefly how it might have happened. 

In a famous experiment in 1953 Harold Urey 
and Stanley Miller sent energy in the form of 
electric sparks through a gas mixture supposed 
to be similar to the early atmosphere of the 
Earth, containing methane, ammonia, hydrogen 
and water vapour. After a few days the solution 
contained several organic compounds, including 
some amino acids. At that time it was assumed 
that the early atmosphere was reducing. More re¬ 
cent studies suggest that this is not quite true, and 
the earliest atmosphere was rather neutral, con¬ 
taining mostly CO 2 , CO, N 2 , H 2 O and maybe 
some H 2 . Such an atmosphere would have pro¬ 
duced organic compounds much slower, if at all. 

Some amino acids have been found in me¬ 
teorites. Thus they seem to have been already 
present in the nebula from which the planetary 
system condensed. Complex organic molecules 
have been found also in interstellar molecule 
clouds (Sect. 15.3). There have even been claims 
of detecting the simplest amino acid, glycine, but 
the results are controversial. 

The next step, putting the basic blocks to¬ 
gether to form DNA or RNA molecules, is much 
more difficult. This looks like the chicken and 
egg paradox: the information contained in the 
DNA is needed to make proteins, and proteins 
are needed to catalyse the production of the nu¬ 
cleotides, which are the building blocks of the nu¬ 
cleic acids. So which came first? 

In the 1980’s Sidney Altman and Thomas 
Cech found that some RNA molecules can act as 
catalysts. Since RNA resembles DNA, it can store 
genetic material to some extent. Thus there is no 


need for the DNA and proteins. Even RNA frag¬ 
ments cannot be synthesised easily, but as they 
act as enzymes and can replicate, it is assumed 
that the initial chemical evolution first led to short 
and relatively simple RNA molecules. Eventually 
some of then combined to more complex ones, 
some of which were better adapted to the envi¬ 
ronment either by replicating faster or by being 
more durable. Thus the natural selection started 
to produce more complex molecules; this chem¬ 
ical evolution was working already before actual 
life emerged. 

The first cell-like structures could evolve 
from asymmetric molecules or lipids, one end 
of which attracts water and the other end re¬ 
pels water. In water such molecules tend to form 
bi-layered membranes where the hydrophilic or 
water-attracting end points outwards and hy¬ 
drophobic or water-repelling end inwards. Fur¬ 
ther on, such membranes form spontaneously 
spherical vesicles. If RNA happened to get in¬ 
side such a membrane, it may have been protected 
from the environment, and could have been con¬ 
tained within its own chemical environment. In 
some cases this could have improved its replica¬ 
tion, and thus led to further increase its concen¬ 
tration within the vesicle. 

A rather common assumption is that the first 
primitive life forms were RNA life. However, re¬ 
cently this theory has been challenged. RNA has 
some drawbacks. It is not as stable as DNA, and 
its replication is not as accurate as the protein me¬ 
diated replication of DNA. If the life was initially 
based on RNA, evolution led finally to the appear¬ 
ance of DNA molecules. Since DNA is superior 
to RNA due to its stability, it soon took over the 
role of information carrier. 

Currently the energy of sunlight is utilised 
by plants and some bacteria in photosynthe¬ 
sis, which produces carbohydrates from water 
and carbon dioxide. There are also organisms 
that do not need sunlight but can use chemi¬ 
cal energy to produce organic matter in a pro¬ 
cess called chemosynthesis. Such organisms have 
been found e.g. near hydrothermal vents on mid¬ 
ocean ridges (Fig. 21.3). These vents eject hot 
mineral-rich water to the ocean. Even though 
the temperature can be as high as 400 °C, the 
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Fig. 21.3 Black smokers in the Mid-Atlantic Ridge are 
hydrothermal vents that sprout hot mineral-rich water. 
(Photo: P. Rona, Credit: OAR/National Undersea Re¬ 
search Program (NURP); NOAA) 

high pressure prevents the water from boiling. 
Although this is too hot for life, there are re¬ 
gions around the vents where the temperature is 
suitable for such thermophiles. They could have 
been the first life forms, in which case life did not 
emerge in a Darwinian warm pond but in a hot 
pressure kettle. However, this is also a matter of 
debate. 

This kind of bottom-up approach tries to build 
life from the simple constituents already avail¬ 
able in the interstellar space. Another approach, 
the top-down method, tries to trace life back in 
time as far as possible. 

The oldest sediment rocks on the Earth, found 
in Isua in western Greenland, are 3.8 Ga old. 
Since they contain sediments, deposited by water, 
and pillow lavas, formed in water, the tempera¬ 
ture at that time could not have a value very dif¬ 
ferent from the current one. The solar luminosity 
was then lower than nowadays, but the difference 


was compensated by a higher amount of decaying 
radioactive materials and remanent heat of the re¬ 
cently born Earth. 

Oldest signs of life are almost as old. These 
signs are, however, just isotope ratios that can be 
interpreted as results of bacterial life. The carbon 
isotope 12 C is about 100 times as abundant as the 
heavier isotope 13 C. The lighter isotope is some¬ 
what more reactive and tends to be enriched in 
living organisms. In the Isua rocks there are sed¬ 
iments with a small excess of 12 C, which might 
indicate some kind of life. 

In the Warrawoona Group in Australia there 
are 3.5 Ga old formations that look like stro¬ 
matolites, mounds consisting of layers of mi¬ 
crobial cells and calcium carbonate. If they are 
real stromatolites, they may have been formed by 
cyanobacteria, but this is still a matter of debate. 

In the early times, at least for a billion years, 
photosynthesis was non-oxygenic. Cyanobacte¬ 
ria were possibly the first organisms capable of 
oxygenic photosynthesis. They started to produce 
oxygen, but initially it was dissolved in water and 
consumed in different oxidation reactions. Even¬ 
tually also the amount of atmospheric oxygen 
started to rise, and 2.2 Ga ago it reached 10 % 
of the current value, i.e. about 2 % of the total 
abundance in the atmosphere. 

First eukaryotes appeared in the fossil record 
2.1 Ga ago and multicellular organisms 1.5 Ga 
ago. The fossil evidence becomes much clearer 
towards the end of the Proterozoic era. The Edi- 
acara fauna, which is about 600 million years old, 
contains the oldest fossils of big and complex an¬ 
imals. These were softbodied animals. At the end 
of the Cambrian period 543 million years ago 
traces of the Ediacara fauna disappear and are re¬ 
placed by a huge variety of new animals, many 
with protecting shields. This increase in the vari¬ 
ety of life forms is called the Cambrian explosion. 

All life forms use similar genetic codes, which 
indicates that they have the same origin. This 
forefather of all life is called LUCA, the Last Uni¬ 
versal Common Ancestor. 

Relationships of living beings can be studied 
by comparing their DNA or RNA. The more the 
molecules of two species differ, the more distant 
the species are in the evolutionary sense. These 
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Fig. 21.4 A simplified 
phylogenetic tree. 

A branch is the older the 
closer it is to the last 
common ancestor, LUCA. 
(Adapted from Webb: 
Where is everybody ?) 



distances can be plotted as a map, called the phy¬ 
logenetic tree (Fig. 21.4). 

The phylogenetic tree, as we now know it, has 
three branches, the domains of Archciea, Bacte¬ 
ria and Eukarya. The organisms closest to the 
root are thermophiles that live close to hydrother¬ 
mal vents or in hot water. Obviously, the LUCA 
lived in such a hot environment. However, RNA 
molecules do not remain intact in such hot en¬ 
vironments. If the earliest life was RNA life, it 
would have evolved in a cooler environment. Cur¬ 
rently we do not know the real birthplace of life. 

Although the phylogenetic tree points to a 
common origin, there may have been other starts, 
too, but natural selection has eliminated the other 
ones that were less competitive. 


21.6 Are We Martians? 

Mars and the Earth have very similar rotation pe¬ 
riods and axial tilts. Since the notorious ‘discov¬ 
ery’ of canals, Mars and its inhabitants have been 


a popular subject in science fiction stories. Later 
observations revealed a very thin atmosphere and 
low temperature, which make Mars a rather hos¬ 
tile place. Finally, the Viking landers showed 
a marred planet. Yet the possibility of some sim¬ 
ple life forms cannot be excluded. 

In 1984 a meteorite was found in the Allan 
Hills region in Antarctica and labelled as ALH 
84001 (Fig. 21.5). The piece of rock was esti¬ 
mated to be 3.9 Ga old. Chemistry of the mete¬ 
orite shows that it had originated on Mars; an im¬ 
pact had thrown it to an orbit that brought it to the 
Earth. 

In 1996 a group of NASA scientists an¬ 
nounced that the meteorite contained structures 
resembling fossilised microbes and compounds 
that could be products of living organisms, such 
as polycyclic aromatic hydrocarbons (or PAH) 
and magnetite. However, they can be produced by 
other processes, too. Not surprisingly, the results 
and the implications of Martian life caused a lot 
of scepticism. Only further Mars expeditions and 
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Fig. 21.5 The meteorite 
ALH 84001 found in 
Antarctica has a chemical 
composition that indicates 
its Martian origin. The 
meteorite contains 
substances that can be 
produced by living 
organisms. The wormlike 
structures resemble 
bacteria. It is, however, 
reasonable to ask whether 
one single sample can 
prove that once there was 
life on Mars. (NASA) 



possible in situ experiments can decide whether 
there has been life on Mars. 

In case there has really been life on Mars, there 
are several possibilities: 

• Life originated independently on the Earth and 
Mars. 

• Life originated only on the Earth and was then 
transported to Mars. 

• Life originated only on Mars and was trans¬ 
ported to the Earth. 

It seems that life on the Earth emerged almost 
as soon as the conditions became favourable. It 
has been argued that the life appeared even too 


quickly. This problem would be solved if life 
originated on Mars. The surface of the more dis¬ 
tant and smaller Mars had cooled down faster 
to become habitable before the Earth. Thus life 
would have had more time to evolve on Mars, 
and was transferred to the Earth when conditions 
here became suitable. Thus our earliest ancestors 
could be Martian bacteria. Presently such consid¬ 
erations are, however, just speculations. 

The idea of life spreading from one celestial 
body to another is known as panspermia. The 
idea dates back to the antiquity, but its first seri¬ 
ous advocate was the Swedish chemist Svante Ar- 
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rhenius, who published a book on the subject in 
1908. Of the later proponents, Sir Fred Hoyle was 
the most famous. Panspermia fit well to his cos¬ 
mology: the universe had no beginning, neither 
did life, but had always existed. Thus the tough 
problem of the origin of life was neatly avoided. 

Now panspermia, in a certain more limited 
sense, begins to seem a little more possible the¬ 
ory. Primitive life forms can survive inside me¬ 
teorites in the coldness and lethal radiation of 
the interplanetary space long enough to travel 
from one planet to another. Interstellar distances, 
though, are too long, and the probability of a me¬ 
teoroid from one planetary system hitting another 
system is too low. It seems obvious that our life 
has originated here in our own solar system. 


21.7 Life in the Solar System 

Once there may have been life on Mars. Although 
probes have not detected signs of life, it is not 
impossible that there might still be some mi¬ 
croscopic life, but we cannot expect to find any 
macroscopic life forms. The same is true for other 
places in our solar system. 

Mercury has no atmosphere, Venus is too hot, 
and the giant planets have no solid surface. Be¬ 
sides the Earth and Mars this leaves only some 
satellites as possible habitats. It has also been 
speculated that there might be living things float¬ 
ing in the atmospheres of the giant planets, but 
the emergence and evolution of such things seems 
rather improbable. 

Europa as well as some other icy satellites 
are nowadays considered potential places for life. 
Their surfaces are too cold, but tidal heating 
keeps the interiors warm enough. Big satellites 
rotate synchronously, but if the orbit is not per¬ 
fectly circular, the orbital velocity varies accord¬ 
ing to Kepler’s second law. Thus the satellite li- 
brates just like our Moon, which means that the 
direction of the tidal distortion keeps changing. 
Also the distance to the planet varies, and there¬ 
fore the magnitude of the tidal force varies, too. 
These effects deform the satellite continuously 
giving rise to tidal heating. 

Europa’s surface is covered by ice. In some 
places the ice cover is broken into plates that ob¬ 


viously have moved relative to each other. The ro¬ 
tation period of the surface differs from the rota¬ 
tion period of the magnetic field, which is frozen 
to the interior. Observations seem to indicate that 
the ice cover floats on an ocean. 

The illumination under the ice is too dim for 
photosynthesis, but there might be thermal vents 
as in the oceans of the Earth. Hence the ocean 
could be a habitat for microbes that can utilise 
the thermal energy. 

Titan is the only satellite with a thick atmo¬ 
sphere. The atmosphere is also rich in organic 
compounds, like methane. Methane dissociates 
rapidly, and thus the high methane content means 
that there must be a source of new methane. Liv¬ 
ing organisms are one such source, but because 
of its coldness Titan does not look like a promis¬ 
ing place for life. A more plausible theory was 
that there was a methane ocean on Titan but the 
Huygens probe revealed a rather dry landscape, 
which, however, has signs of liquid flows. Radar 
images sent by the Cassini probe show some dark 
areas that might be methane lakes (Fig. 8.25). 


21.8 Detecting Life 

If we find a potentially habitable planet, is there 
any hope that we can see if life has emerged on 
it? The question can be answered by studying 
whether we can find life on the Earth from satel¬ 
lite observations. In 1990 the Galileo probe made 
just such experiments, and it seems that it is in¬ 
deed possible to detect life, at least the kind of 
life we have on the Earth. Similar observations of 
the Moon showed no traces of life. 

The detection is based on spectroscopic obser¬ 
vations that can reveal some signatures of life. 
These signatures are in the infrared part of the 
spectrum, thus requiring observations made out¬ 
side the atmosphere. 

There are two emission features that are strong 
indicators of life, ozone and methane. Photosyn¬ 
thesis is the most probable source of molecular 
oxygen, which is then broken into two oxygen 
atoms by ultraviolet radiation. The free oxygen 
atoms join to molecules to form ozone. Methane 
is also produced by living beings. It is quickly ox¬ 
idised, and has to be continuously replenished to 
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keep the level noticeable. However, there can be 
large reservoirs of methane, particularly in cold 
environments. Thus methane itself is not a sign 
of life, but if it is found together with ozone, the 
evidence becomes more convincing. 

Another feature is the infrared reflectance 
spectrum of the green plants. Chlorophyll absorbs 
visible light, particularly blue and red, but there 
is a distinct cutoff called the red-edge, seen as 
a steep gradient of the spectrum between 690 and 
740 nm. Longer wavelengths are very effectively 
reflected to avoid excessive heating. 


21.9 SETI—Detecting Intelligent Life 

Mankind has been sending radio transmissions 
for almost a century. Our radio signals are now 
filling a sphere with a radius of almost one 
hundred lightyears. Another civilisation orbiting 
a nearby star might be able to pick up this trans¬ 
mission with a big radio telescope. Such leakage 
radiation is, however, very weak. Sensitivity of 
our own radio receivers has increased enormously 
since they were invented. Thus it has been possi¬ 
ble to reduce the power of the transmitters, and 
the signals leaking to space have become weaker. 
Also, more and more signals are sent in cables 
and optical fibres. If another civilisation has un¬ 
dergone similar development, detecting leakage 
signals is extremely difficult. Chances are much 
better if the signal has been sent intentionally to¬ 
wards potential receivers in the hope that some¬ 
body will detect it. 

One might think that galaxies and star clusters 
are worth listening, since there are many stars in 
the narrow beam of the telescope. Unfortunately, 
it is not quite so. Other galaxies are so far away 
that the signal could be too weak to be detected. 
Globular clusters consist of very old stars with 
low metallicity. Thus the probability of finding 
a habitable planet is tiny indeed. Open clusters 
are relatively young, and life may not have had 
time to evolve to a communicating civilisation. 
Thus galaxies and star clusters are not the best 
places to search for signs of life. 

What frequency should we use? If the sending 
and receiving party have developed radio astron¬ 
omy, they must be aware of certain common fre¬ 
quencies, like the hydrogen 21 cm radiation. Such 


a wavelength itself may not be a good choice be¬ 
cause of the background noise, but some of its 
multiples or a sum of two common frequencies 
might fall in the quiet part of the radio spectrum. 
A good frequency could be the H 2 O maser emis¬ 
sion at 22 GHz. Around this frequency the sky 
is pretty quiet except for a few sources. But it 
is not enough to listen to those frequencies only, 
since they are Doppler shifted due to the relative 
motion of the transmitter and receiver. And if the 
transmitter and/or receiver are on planets orbiting 
a star, the Doppler shift will change periodically. 
Fortunately, current receivers are capable of fol¬ 
lowing millions of frequencies simultaneously. 

Radio emission from natural sources can be 
steady noise or vary in a periodic, quasiperiodic 
or chaotic manner. If we want to send a signal to 
be recognised as artificial, it should contain a pat¬ 
tern that cannot arise naturally. It could e.g. con¬ 
tain an increasing number of pulses representing 
the first few prime numbers. 

Although most SETI research concentrates on 
radio frequencies, also optical wavelengths have 
recently been considered seriously. Pulsed lasers 
pack a lot of energy to a short pulse (lasting typi¬ 
cally one nanosecond) confined to a very narrow 
wavelength band and a narrow beam. The flash 
can be even brighter than the central star. If such 
a signal were pointed towards us, it should be rel¬ 
atively easy to detect. Such optical SETI research, 
or OSETI, has already been started, but it is still 
behind the radioastronomical SETI. 

In 1974 Frank Drake used the Arecibo radio 
telescope to send a message towards the globular 
cluster M13 (which is not a good place for life). 
The message contained 1679 pulses (Fig. 21.6). 
This number has exactly two factors, 23 and 73. 
Thus the receiver, who obviously must under¬ 
stand some mathematics, could guess that the 
message contains a two-dimensional picture. If 
we ever detect such a message, we can be pretty 
certain of its artificial origin, even if we were not 
able to interpret the message. 

The first serious SETI project (Search for Ex- 
traTerrestrial Civilisations) was carried out in 
1960, also by Frank Drake. This project Ozma 
observed two nearby stars, r Cet and s Eri, at the 
21 cm wavelength. Since then radio technology 
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Fig. 21.6 The signal sent with the Arecibo radio tele¬ 
scope consists of 1679 pulses. The number 1679 has only 
two factors, 23 and 73, so obviously the signal represents 
a twodimensional image of 23 x 73 pixels. A lot of in¬ 
formation about the sender is coded in the image, but it 
is difficult to say whether the receiving civilisation could 
interpret it. In any case, the artificial origin of the signal is 
indisputable 

has improved enormously, and currently we are 
able to listen to a huge number of channels at the 
same time. 

There are two basic search strategies. In a tar¬ 
geted search we listen to a few objects that are po¬ 
tential candidates for harbouring life. In a wide- 
sky survey large areas of the sky are scanned. 
Most current projects belong to the latter cate¬ 
gory. 


Observing time with large telescopes is ex¬ 
pensive, and priority is given to projects that can 
be expected to produce positive results. In some 
projects, like SERENDIP, this problem is avoided 
by having the receiver sit piggyback on some 
other instrument and listen to whatever the tele¬ 
scope happens to be observing. Thus the SETI 
project does not need any dedicated time of its 
own. The drawback is, of course, that many of 
the target areas may not be interesting in the SETI 
sense. 

Detecting a potentially artificial signal among 
the huge amount of data requires a lot of comput¬ 
ing power. The seti@home project has connected 
millions of computers to an enormous virtual ma¬ 
chine to analyse the data. Anyone with a com¬ 
puter connected to the Internet can load a screen¬ 
saver program that will automatically fetch pack¬ 
ets of data and send back the results. 

Thus far not a single message sent by another 
civilisation has been confirmed. There have been 
some interesting cases, but they have been single 
bursts of unknown origin. They have not been de¬ 
tected later, not even with more sensitive instru¬ 
ments. 


21.10 Number of Civilisations 

Although no extraterrestrial civilisation has been 
found, we can try to estimate their number. The 
SETI pioneer Frank Drake suggested a formula 
for calculating the number of civilisations in the 
Milky Way capable of communication at a given 
instant: 

N = R x / p x / h x f x f x / c x L, (21.1) 

where N is the total number of communicating 
civilisations in the Milky Way, R is the annual 
birth rate of stars, / p is the fraction of stars pos¬ 
sessing planets, /, is the fraction of planets be¬ 
ing habitable, f\ is the fraction of habitable plan¬ 
ets having some kind of life, /; is the fraction of 
these planets having intelligent life, / c is the frac¬ 
tion of intelligent civilisations that have devel¬ 
oped means for interstellar communication, and 
finally L is the time in years that such a civilisa¬ 
tion has been communicating. All the /-factors 
are probabilities that are in the range [0, 1]. 
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The astronomical factors ( R , / p and /j,) are 
the only ones that are known with any accuracy. 
The biological factors, f\ and f. involve a lot of 
guesswork. The last two factors, / c and L, are 
even harder, since they are related to the socio¬ 
logical behaviour of the exocivilisation. 

Actually, the formula was intended as the ba¬ 
sis of the agenda of an influential SETI meeting 
held in Green Bank in 1961. It splits the prob¬ 
lem nicely into smaller subproblems that can be 
discussed separately. But using the formula to 
find the actual number of civilisations is not very 
meaningful, since so many of the factors are to¬ 
tally unknown. In the most “optimistic” case we 
could find out that the distance between neigh¬ 
bouring civilisations is just a few parsecs, but giv¬ 
ing the probabilities small (possibly more realis¬ 
tic) values, their product might be so minute that 
we ought to be alone in the Milky Way. At least 
the formula shows how little we know. 

Even if favourable conditions and evolution of 
communicating civilisations were relatively com¬ 
mon, the last factor may turn out to be the limiting 
one. If the lifetime of a civilisation is short com¬ 
pared to the age of the universe, the chances of 
hearing a message from another star are poor. 

Earlier many astronomers seemed to think that 
exocivilisations would not be that rare, while bi¬ 
ologists showed that the evolution of life had so 
many obstacles that we should not expect to find 
other civilisations in our neighbourhood. Now we 
understand better both the biochemistry of early 
life and the many problems in having a habitable 


planet. Although opinions vary considerably, we 
might guess that very simple microbe-like life is 
relatively common, but intelligent, communicat¬ 
ing beings might be extremely rare. 


21.11 Exercises 

Exercise 21.1 Calculate the limits of the habit¬ 
able zone of the Sun assuming the planet is a fast- 
rotating blackbody with a Bond albedo of 0.3. 
What is the continuously habitable zone, if the lu¬ 
minosity of the Sun was originally 0.7 times the 
current value? 

Exercise 21.2 Assume there are n stars in a cu¬ 
bic parsec and a fraction p of them have com¬ 
municating civilisations. What is the average dis¬ 
tance between two neighbouring civilisations? 
Apply the result to the solar vicinity. The stellar 
density can be estimated from Table C.17. What 
is the average distance between nearest civilisa¬ 
tions, if the probability of a star having a planet 
with a civilisation is (a) 0.01, (b) 0.00001? 

Exercise 21.3 An asteroid with a diameter of one 
hundred metres is approaching the Earth. Esti¬ 
mate the minimum value of the kinetic energy re¬ 
leased in the collision. Compare the result with 
the Hiroshima atomic bomb. The energy of the 
bomb was equivalent to 15 kilotons of TNT. One 
ton of TNT corresponds to the energy of 4.184 x 
10 9 joules. 
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The Copernican principle means that the Earth 
has gradually lost its special status: it is just a 
planet orbiting a relatively common star in an or¬ 
dinary galaxy. But how common planetary sys¬ 
tems are? Nowadays we think that they belong to 
the normal life of a star. However, it is difficult 
to make ground based observations of exoplan¬ 
ets, i.e. planets orbiting some other star than our 
Sun. The light reflected by the planet is so dim 
that it is swamped with the light of the star. The 
situation is, however, improving thanks to evolv¬ 
ing observation methods. 


22.1 Other Planetary Systems 

At the beginning of 2016 over 1300 planetary 
systems and a total of over 2000 planets had been 
found. The numbers have increased rapidly dur¬ 
ing the last few years. It seems that the birth of 
planetary systems is more and more closely re¬ 
lated to the birth process of stars. 

Due to the observing methods most of the 
known exoplanets are rather massive. Recently 
also planets about the size of the Earth have been 
detected. The next big question is if any of them 
is suitable for harbouring life. 

Origin of planetary systems have been studied 
using theoretical calculations. Most calculations 
have dealt with single stars around which stable 
planetary orbits are easier to find. However, stable 
orbits are also possible around wide binary stars 
either close to one of the components, or so far 
that the planet orbits both components. 


The Milky Way contains about 10 11 stars. 
Maybe 10 9 -10 10 of them have suitable condi¬ 
tions for a planetary system. 

The first exoplanet was found in 1992. It was 
quite a strange object, orbiting a pulsar. Pulsars 
are final stages of stellar evolution, and planets 
orbiting the star should be destroyed. 

Some earlier observations had given hints 
about exoplanets but they were not conclusive. 
The first exoplanet orbiting an ordinary star was 
found in 1995 around /I Pictoris. 

In addition to planets orbiting a star there are 
some indications of rogue planets that have es¬ 
caped from planetary systems. Star density in star 
birth regions can be so high that the perturba¬ 
tions of other stars throw planets away from their 
original orbits. Such runaway planets are even 
more difficult to detect than planets orbiting other 
stars. 


22.2 Observational Methods 

Although only a few planets can be observed 
directly with current methods, the existence of 
planets can be deduced using several indirect 
methods. 

Astrometric methods are based on perturba¬ 
tions of the proper motion or radial velocity of 
the star. If the mass of the star is big enough the 
centre of mass will be clearly displaced from the 
star. For example, the centre of mass of our solar 
system is outside the surface of the Sun, and the 
Sun as well the planets orbit around this point. 
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Therefore the motion of the Sun through space is 
not linear but shows particularly the 12 year pe¬ 
riod of Jupiter’s orbital motion. 

Earliest observations were based on similar as¬ 
trometric measurements as with binary stars. The 
wobble of the star caused by a planet is so minute 
that the results were not conclusive. The advan¬ 
tage of the method is that it does not depend on 
the angle between the orbital plane and the line of 
sight. 

Much more precise results have been obtained 
by studying the periodic changes of the radial ve¬ 
locities of stars using the Doppler effect. The ra¬ 
dial velocities of many nearby stars show indeed 
periodic variations indicating that they have mas¬ 
sive invisible companions. The method makes 
it possible to detect Jupiterlike planets around 
nearby planets, but smaller ones are too light to 
cause detectable variations in the radial veloc¬ 
ity of the star. This is why most of the currently 
known exoplanets are more massive than Jupiter. 

This method works the better the closer the ob¬ 
server is to the orbital plane of the orbit. If the or¬ 
bital plane is almost perpendicular to the line of 
sight no Doppler shift cannot be seen. 

A third method is based on transits of plan¬ 
ets: when a planet is between the star and the ob¬ 
server it occults a part of the stellar disk. Such 
events have already been observed with ground 
based telescopes, but especially the Kepler space¬ 
craft sent by NASA in 2009 has investigated them 
by following about one hundred thousand stars in 
the galactic habitable zone. The method works, 
of course, only if the observer is very close to the 
orbital plane of the planet. Yet over a thousand 
planets have been found, many of which are about 
the size of the Earth. Planetary transits are seen 
as very regular dimming of the star. At the same 
time the satellite has observed also other kinds of 
brightness variations, thus advancing studies of 
variable stars. 

A fourth method is gravitational lensing. If a 
star passes in front of a background star its grav¬ 
ity will bend the light, and the background star 
will look brighter. If the front star has planets the 
brightening is slightly stronger than without plan¬ 
ets. 

Also rogue planets can be detected with the 
transit or gravitational lens methods. Then the 


brightness will change only once and for a very 
short period. Interpretation of such events is prob¬ 
lematic: is it really a planet or a phenomenon of 
the star itself. 

A fifth method is timing, which is well applied 
to regularly variable stars. A planet causes the 
distance of the star vary slightly, which changes 
the periodicity of the brightness variation of the 
star in a regular manner. 

This was the method that gave the first con¬ 
vincing evidence of exoplanets in 1992 when the 
pulsation of the pulsar PSR B1257+12 showed 
regular variation. The discovery was strange 
since a pulsar should not have planets. The ob¬ 
jects may be second generation planets that con¬ 
densed from the material ejected by a supernova 
explosion. Later the same method has also re¬ 
vealed some planets around variable stars. 

Direct imaging of exoplanets is not yet an ef¬ 
fective search method. The first images show ob¬ 
jects whose existence has already been confirmed 
with other methods (Fig. 22.1). 


\ 

\ 


Fig. 22.1 A detailed image of the circumstellar disk 
around fi Pictoris was obtained with the ESO ADONIS 
adaptive optics system at the 3.6 m telescope at La Silla, 
Chile, and the Observatoire de Grenoble coronagraph at 
the wavelength of 1.25 microns in 1996. The full extent 
of the disk is about 1500 AU. The area around the star 
masked by the coronagraph is only 24 AU (correspond¬ 
ing to a distance from the Sun to halfway between Uranus 
and Neptune). No planets are directly seen but their grav¬ 
itational effects can be detected as a bending of the main 
plane in the inner part of the disk. (ESO) 
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Also young recently born stars can indicate 
planetary systems. Many stars are surrounded by 
a dust disk (Fig. 22.2). The easiest way to explain 
the infrared radiation of some young stars is to 
assume a massive disk surrounding the star and 
interacting with the star. Later the disk can evolve 
to a planetary system. Such disks have been de¬ 
tected around e.g. Vega and (3 Pictoris. 



The Brown Dwarf 2M1207 and its Planetary Companion 
(VLT/NACO) 


ESO PH Pluto 14*05 <30 Ajjfll 2005) cLSO WCM 

Fig. 22.2 The first image of an exoplanet was obtained 
in 2004 with the VLT telescope. The mass of the planet is 
about five times the mass of Jupiter and it orbits a brown 
dwarf at a distance of 55 AU. (ESO) 


22.3 Properties of Exoplanets 

For practical reasons it is easiest to find big plan¬ 
ets orbiting close to a star. They affect the posi¬ 
tion of the star most strongly, and due to the short 
orbital period the periodicity of the variations is 
most easy to detect. Therefore many of the exo¬ 
planets are such “hot jupiters”. This is seen in the 
distribution of Fig. 22.3. A great majority of the 
exoplanets found this far are clearly more mas¬ 
sive than the Earth. This is another example of a 
statistical bias caused by a selection effect typ¬ 
ical in astronomy. There may be a lot of Earth¬ 
sized planets but they are extremely difficult to 
observe. 

The planetary orbits of our own solar system 
are nearly circular. If Mercury is excluded the 
eccentricity of every orbit is less than 0.1. Or- 



eccentricity 


Fig. 22.4 Distribution of the eccentricities of the known 
exoplanets 


Fig. 22.3 Orbits and 
masses of some extrasolar 
planets. Open circles are 
planets orbiting a pulsar. 
Planetary masses are given 
in Jupiter’s masses. The 
figure contains some 1200 
planets that were known 
reasonably well at the 
beginning of 2016. 
Actually, a lot more planets 
have been detected, but 
their data are yet quite 
uncertain 
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bits of the exoplanets, on the other hand, can be 
very eccentric (Fig. 22.4). Theories concerning 
the evolution of planetary systems must be able to 
explain how the eccentricities decrease by, for ex¬ 
ample, collisions of minor bodies or mutual per¬ 
turbations of planets. 

When also solid earthlike planets can be ob¬ 
served the next question is, if any of them has an 
environment suitable for life. After that we can 
investigate if their spectra show features typical 
for life mentioned in the previous chapter. 


22.4 Exercises 

Exercise 22.1 How much the apparent place of 
the Sun oscillates due to Jupiter’s orbital motion 
as seen from the distance of 10 parsecs. 

Exercise 22.2 A distant observer detects the 
brightness of the Sun changing when Jupiter tran¬ 
sits the Sun. How big is this change in magni¬ 
tudes? And what is the change when the Earth 
transits the Sun? 



Mathematics 


A 


A.1 Geometry 

Units of Angle and Solid Angle Radian is the 
angular unit most suitable for theoretical studies. 
One radian is the angle subtended by a circular 
arc whose length equals the radius. If r is the ra¬ 
dius of a circle and s the length of an arc, the arc 
subtends an angle 


surface of a sphere with radius r subtends a solid 
angle 

co = A/r 2 . 

Since the area of the sphere is Anr 2 , a full 
solid angle equals An steradians. 

Circle 


a — s / r. 


Area A — nr 2 . 



2jrrad=360° or 1 rad = 180°/7T. Sphere 



In an analogous way we can define a stera- 
dian, a unit of solid angle, as the solid angle sub¬ 
tended by a unit area on the surface of a unit 
sphere as seen from the centre. An area A on the 


Area A = Anr . 


Volume V = -nr . 


Volume of a sector \4 = —nrh 


= ~^nr\\ - cosa) = V sp here hava. 

Area of a segment A s = 2nrh = 2nr 2 (\ — cosa) 

= A sphere hav a. 
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Equation in polar coordinates 
P 

r = 7--7> 

1 + e cos/ 

where the distance r is measured from one focus, 
not from the centre. 

When e — 0, the curve becomes a circle. 


A.2 Conic Sections 

As the name already says, conic sections are 
curves obtained by intersecting circular cones 
with planes. 


Hyperbola Equations in rectangular and polar 
coordinates 



r = 


P 

1 + ecosf 



Ellipse Equation in rectangular coordinates 


2 2 

- + - = 1 
a 2 b 2 



a — the semimajor axis, 
b — the semiminor axis b — c/V 1 — e 2 , 
e = eccentricity 0 < e < 1. 

Distance of the foci from the centre c = ea. 
Parameter (semilatus rectum) p — a( 1 — e 2 ). 
Area A — nab. 



Eccentricity e > 1. 

Semi-minor axis b — a-Je 2 — 1. 
Parameter p = a (e 2 — 1). 
Asymptotes y = ±-x. 


Parabola Parabola is a limiting case between 
the previous ones; its eccentricity is e = 1. 
Equations 


x — —ay~ 


1 + cos / 



Distance of the focus from the apex h — 1/4 a. 
Parameter p= 1/2 a. 
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A.3 Taylor Series 

Let us consider a differentiable real-valued func¬ 
tion of one variable / : R —»■ R. The tangent to 
the graph of the function at xo is 

y = /(-*o) + /'(•* o)(x - xo), 

where f(x o) is the derivative of / at xq. Now, 
if x is close to xo, the graph of the tangent at x 
will not be very far from the graph of the function 
itself. Thus, we can approximate the function by 


1 — x 


= 1 + X + x 2 + x 3 + ■ 


/- 1 1 , 1 , 

,- 1 1 , 1 , 

VI— x = \ -x-x- x — 

2 8 16 

1 , 1 3 2 5 3 

, = l — -X + -x — — x — 

Vl + x 2 8 16 

^ =1 + I r+ 3 r 2 + _5_ ^3 

VT^V 2 8' 16' 


fix) « fix o) + f'ixo)ix - xo). 

The approximation becomes worse, the more the 
derivative f' varies in the interval [xo,x]. The 
rate of change of f is described by the second 
derivative f", and so on. To improve accuracy, 
we have to also include higher derivatives. It can 
be shown that the value of the function / at x is 
(assuming that the derivatives exist) 

fix) = fix o) + fix o)(x - x 0 ) 

+ \f"ix o)(x -x 0 ) 2 H- 

+ — f n) ix o)(x - xo)" H-, 

n\ 

where / '"1 (xo) is the n th derivative at xo and n ! is 

the n -factorial, n\ = 1-2-3 . n. This expansion 

is called the Taylor series of the function at xq- 



The following list gives some useful Taylor se¬ 
ries (in all these cases we have xq = 0): 


--= 1 - x + x 2 - x 3 H- 

1 + x 


converges if |x| < 1 


e* = i + x+Ix 2 +ix 3 + --. + ix n 

2! 3! n\ 

+ ■ ■ ■ converges for all x 
1 , 1 , 1 4 

ln(l + x) = x — -x + -x J — -x H- 

x e (-1,1] 


1 3 1 5 

sinx = x — —x + —x 

1 2 1 4 

cosx = 1-x H-x - 

2! 4! 


for all x 

for all x 


1 3 2 5 , , n 

tanx = x +-x + —x H- \x\ < —. 


Many problems involve small perturbations, in 
which case it is usually possible to find expres¬ 
sions having very rapidly converging Taylor ex¬ 
pansions. The great advantage of this is the re¬ 
duction of complicated functions to simple poly¬ 
nomials. Particularly useful are linear approxima¬ 
tions, such as 


Vl +x « 


1 

l + 2-v. 


1 

Vl + X 



etc. 


A.4 Vector Calculus 

A vector is an entity with two essential proper¬ 
ties: magnitude and direction. Vectors are usually 
denoted by boldface letters a, b , A , B etc. The 
sum of the vectors A and B can be determined 
graphically by moving the origin of B to the tip of 
A and connecting the origin of A to the tip of B. 
The vector —A has the same magnitude as A, is 
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parallel to A, but points in the opposite direction. 
The difference A — B is defined as A A (— B). 



is 

A + B = (a x + b x )i + (a y A b y )j A (a z A b z )k. 

The magnitude of a vector r in terms of its com¬ 
ponents is 

r= \r\ =Jx 2 + y 2 + z 2 . 

The scalar product of two vectors A and B is 
a real number (scalar) 


Addition of vectors satisfies the ordinary rules 
of commutativity and associativity, 

A+ B = B + A, 

A + (B + C) = (A + B) + C. 



A point in a coordinate frame can be specified 
by giving its position or radius vector , which ex¬ 
tends from the origin of the frame to the point. 
The position vector r can be expressed in terms 
of basis vectors , which are usually unit vectors , 
i.e. have a length of one distance unit. In a rectan¬ 
gular vy"-frame, we denote the basis vectors par¬ 
allel to the coordinate axes by i, j and k. The po¬ 
sition vector corresponding to the point (x, y, z) 
is then 

r = xi A yj + zk. 

The numbers x, y and z are the components of r. 
Vectors can be added by adding their compo¬ 
nents. For example, the sum of 

A —= a x i A a yj A a-k , 

B — b x i + by j A b-k. 


A - B — a x b x Aa y b y Aa z b z — |A||B| cos(A, B), 

where (A, B) is the angle between the vectors A 
and B. We can also think of the scalar product 
as the projection of, say, A in the direction of B 
multiplied by the length of B. If A and B are 
perpendicular, their scalar product vanishes. The 
magnitude of a vector expressed as a scalar prod¬ 
uct is A — | A| = VA ■ A. 

The vector product of the vectors A and B is 
a vector 

A x B — (Uyb~ a z b y )i A {a z b x a x b z ) j 
+ (a x b y - a y b x )k 

i j k 

fly fl^ • 

b x by b z 

This is perpendicular to both A and B. Its length 
gives the area of the parallelogram spanned by 
A and B. The vector product of parallel vec¬ 
tors is a null vector. The vector product is anti- 
commutative: 


A x B — —B x A. 



Scalar and vector products satisfy the laws of 
distributivity: 

A(BAC) = ABAAC , 








A.5 Matrices 


467 


Ax(B + C) = AxB + AxC, 
(.A + B) C = A C + B C , 

(A + B)xC — AxC + BxC. 


A scalar triple product is a scalar 


Ax B C 


Cl% Cly 

b x by b z 

c x c y c z 


Here the cross and dot can be interchanged and 
the factors permuted cyclically without affecting 
the value of the product. For example Ax B C = 
B x C A = B C x A.butAx B C = -B x 
AC. 

A vector triple product is a vector, which can 
be evaluated using one of the expansions 


A x (B x C) — B{A C) - C(A B), 
(A x B) x C — B(A C) - A(B C). 



When computing a derivative of a vector product, 
one must be careful to retain the order of the fac¬ 
tors, since the sign of the vector product changes 
if the factors are interchanged. 


A.5 Matrices 

Assume we have a vector x with components 
(x, y, z). We can calculate another vector x' — 
( x',y',z'), the components of which are linear 
combinations of the original components: 


In all these products, scalar factors can be 
moved around without affecting the product: 

A kB — k(A ■ B), 

Ax (B x kC) = k(A x{B x C)). 

The position vector of a particle is usually 
a function of time r — r(t ) = x(t)i + y(t)j + 
z(t)k. The velocity of the particle is a vector, 
tangent to the trajectory, obtained by taking the 
derivative of r with respect to time: 


x' — anx + any + anz, 

y' — a 2 \x + an y + 0232 , 

z' = 031* + 032 y + 033Z. 

This is a linear transform that maps the vector x 
to a vector x’. 

We can collect the coefficients to an array, 
called a matrix A: 

( an a 12 oi 3 
021 022 023 

031 032 033 



Q /V /V /V 

v — —r(t ) — r— xi + yj + zk. 
at 

The acceleration is the second derivative, r. 

Derivatives of the various products obey the 
same rules as derivatives of products of real¬ 
valued function: 

d 

— (A B) = A ■ B + A B, 
dr 

d 

— (A x B) — A x B + A x B. 
dr 


A general matrix can consist of an arbitrary num¬ 
ber of rows and columns. In this book we need 
only matrices operating on vectors of a three- 
dimensional space, and they always have three 
rows and columns. Two subscripts refer to the dif¬ 
ferent elements of the matrix, the first one giving 
the row and the second one the column. 

When using matrix formalism it is convenient 
to write vectors in the form of column vectors: 
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We now define that the product of a matrix and 
a column vector 


the previous example in the reverse order, we get 
quite a different result: 


or 



x' 

= Ax 


011 

a 12 

013 

«21 

022 

023 

+31 

an 

033 



/I 2 OWl 1 1\ /I 3 5 

2 1 10 1 2 = 3 5 7 

\1 3 2/ \1 2 3/ \3 8 13 

A unit matrix I is a matrix, which has ones on 
its diagonal and zeros elsewhere: 



means just 


x' — a\\x + any + 013 + 

y' — aiix + aiiy + 023 + 
z! — cit,\x + any + anz. 

Comparing these equations we see, for example, 
that the first component of x' is obtained by tak¬ 
ing the first row of the matrix, multiplying the 
components of the vector x by the correspond¬ 
ing components of that row, and finally adding 
the products. 

This definition can easily be generalised to the 
product of two matrices. The elements of the ma¬ 
trix 


C — AB 


/ = 



If a vector or a matrix is multiplied by a unit ma¬ 
trix, it will remain unchanged. 

If the product of two matrices is a unit matrix, 
the two matrices are inverse matrices of each oth¬ 
ers. The inverse matrix of A is denoted by A -1 . 
It satisfies the equations 

A~ l A — AA _I = /. 


In spherical astronomy we need mainly rota¬ 
tion matrices, describing the rotation of a coordi¬ 
nate frame. The following matrices correspond to 
rotations around the x, y and z axes, respectively: 


are 

c ij—Qjkbkj ■ 
k 

This is easy to remember by noting that we take 
the row i of the first factor A and the column j of 
the second factor B and evaluate the scalar prod¬ 
uct of the two vectors. For example 

1 1 lWl 2 0 

0 1 2 II 2 1 1 

1 2 3/ \1 3 2 

/1+2+1 2+1+3 0 + 1 + 2\ 

=0+2+2 0+1+6 0+1+4 
\ 1 +4 + 3 2 + 2 + 9 0 + 2 + 6/ 

/ 4 6 3 

= 4 7 5 

\8 13 8 

When multiplying matrices, we have to be 
careful with the order of the factors, because usu¬ 
ally AB ^ BA. If we multiply the matrices of 



R x (a) = 


*v(«) = 


Rz(<*) = 


0 cos a 

sin a 

0 — sin a 

COS O' 

COS Oi 

0 

sin a' 

0 

1 

0 

— sin a 

0 

COS O' 

cos a 

sin a 0 

— sin a 

cos a 0 

0 


0 1 


If the angle is a = 0, only a unit matrix remains. 

The elements of a rotation matrix can easily 
be determined. For example, a rotation around the 
x axis will leave the x coordinate unaffected, and 
thus the first row and column must be zeroes, ex¬ 
cept for the diagonal element, which must be one. 
This will leave four elements. When the angle is 
zero, the matrix has to reduce to a unit matrix; 
thus the diagonal elements must be cosines and 
the other ones sines. The only problem is to de¬ 
cide, which of the sines will get the minus sign. 
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This is most easily done by testing the effect of 
the matrix on some basis vector. 

The inverse matrix of a rotation matrix cor¬ 
responds to a rotation in the opposite direction. 
Thus it is obtained from the original matrix by 
replacing the angle a by —a. The only change 
in the matrix is that the signs of the sines are 
changed. 

For example, the precession matrix is a prod¬ 
uct of three rotation matrices. Since the matrix 
product is not commutative, these rotations must 
be carried out in the correct order. 

A.6 Multiple Integrals 

An integral of a function / over a surface A 

1= f fdA 
J A 

can be evaluated as a double integral by express¬ 
ing the surface element dA in terms of coordinate 
differentials. In rectangular coordinates, 

dA = dr dy 

and in polar coordinates 

d A — r dr d< -p. 

The integration limits of the innermost integral 
may depend on the other integration variable. 
For example, the function xe y integrated over the 
shaded area is 


1=1 .re v dA = / / .re' d.r dy 

J A Jx = 0 J y = 0 



The surface need not be confined to a plane. 
For example, the area of a sphere is 



where the integration is extended over the sur¬ 
face S of the sphere. In this case the surface el¬ 
ement is 

dS = R 2 cos# d (p d 6, 
and the area is 

/»2 7 i n 7r/2 

A— I R 2 cos 6 d(pdd 

J cp = 0 Jo = — 7t/2 


= f 0 [l-l 2 /2^ 2sin0 ] d ^ 



Similarly, a volume integral 
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can be evaluated as a triple integral. In rectangu- A.7 Numerical Solution 

lar coordinates, the volume element dV is of an Equation 


dV = dx dvdz; 

in cylindrical coordinates 

d V = r dr d q> dz, 

and in spherical coordinates 

dV = r 2 cos 9 dr d<p d 6 

(6 measured from the xy plane) 


or 


d V = r 2 sinO dr d <p dd 

(6 measured from the z axis). 



For example, the volume of a sphere with ra¬ 
dius R is 



We frequently meet equations defying analyti¬ 
cal solutions. Kepler’s equation is a typical ex¬ 
ample. If we cannot do anything else, we can 
always apply some numerical method. Next we 
shall present two very simple methods, the first 
of which is particularly suitable for calculators. 

Method 1: Direct Iteration We shall write the 
equation as fix) — x. Next we have to find an 
initial value xo for the solution. This can be 
done, for example, graphically or by just guess¬ 
ing. Then we compute a succession of new it¬ 
erates x\ = fix o), X 2 — fix i), and so on, un¬ 
til the difference of successive solutions becomes 
smaller than some preset limit. The last iterate x, 
is the solution. After computing a few x\ ’s, it is 
easy to see if they are going to converge. If not, 
we rewrite the equation as / _ 1 (x) = x and try 
again. (/“* is the inverse function of /.) 

As an example, let us solve the equation x — 
— In x. We guess xo = 0.5 and find 

x\ = — In 0.5 = 0.69, X 2 = 0.37, * 3 = 1.00. 



This already shows that something is wrong. 
Therefore we change our equation to x — e _ ' and 
start again: 


x 0 = 0.5, 

xi = e -0 ' 5 = 0.61, 
x 2 = 0.55, 

X 3 = 0.58, 
x 4 = 0.56, 
x 5 =0.57, 
x 6 = 0.57. 
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Thus the solution, accurate to two decimal places, 
is 0.57. 

Method 2. Interval Halving In some patholog¬ 
ical cases the previous method may refuse to con¬ 
verge. In such situations we can use the foolproof 
method of interval halving. If the function is con¬ 
tinuous (as most functions of classical physics 
are) and we manage to find two points x\ and X 2 
such that f{x\) > 0 and fixf) < 0, we know 
that somewhere between x\ and X 2 there must be 
a point x in which fix) = 0. Now we find the 
sign of / in the midpoint of the interval, and se¬ 
lect the half of the interval in which / changes 
sign. We repeat this procedure until the interval 
containing the solution is small enough. 

We shall try also this method on our equation 
x = — lnx, which is now written as fix) — 0, 
where fix) — x + Inx. Because fix) —> — 00 , 
when x —> 0 and /(1) > 0, the solution must be 
in the range (0, 1). Since /(0.5) < 0, we know 


that x e (0.5, 1). We continue in this way: 


/(0.75) > 0 
/(0.625) > 0 
/(0.563) < 0 
/(0.594) > 0 


x e (0.5,0.75), 
x e (0.5,0.625), 
x e (0.563,0.625), 
x e (0.563,0.594). 


The convergence is slow but certain. Each iter¬ 
ation restricts the solution to an interval which is 
half as large as the previous one, thus improving 
the solution by one binary digit. 
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Albert Einstein published his special theory of 
relativity in 1905 and the general theory of rel¬ 
ativity ten years later. Especially the general the¬ 
ory, which is essentially a gravitation theory, has 
turned out to be very important for the theories of 
the evolution of the Universe. Therefore it is ap¬ 
propriate to consider here some basic principles 
of relativity theory. A more detailed discussion 
would require sophisticated mathematics, and is 
beyond the scope of this very elementary book. 


B.1 Basic Concepts 

Everyone knows the famous Pythagorean theo¬ 
rem 

As 2 = Ax 2 + Ay 2 , 

where As is the length of the hypotenuse of 
a right-angle triangle, and Ax and Ay are the 
lengths of the other two sides. (For brevity, we 
have denoted A .? 2 = (As) 2 .) This is easily gener¬ 
alised to three dimensions: 

As 2 = Ax 2 + Ay 2 + Az 2 ■ 

This equation describes the metric of an ordinary 
rectangular frame in a Euclidean space, i.e. tells 
us how to measure distances. 

Generally the expression for the distance be¬ 
tween two points depends on the exact location 
of the points. In such a case the metric must be 
expressed in terms of infinitesimal distances in 
order to correctly take into account the curva¬ 
ture of the coordinate curves. (A coordinate curve 


is a curve along which one coordinate changes 
while all the others remain constant.) An in¬ 
finitesimal distance d.v is called the line element. 
In a rectangular frame of a Euclidean space it is 

d .? 2 = dx 2 + dy 2 + dz 2 , 

and in spherical coordinates 

d .? 2 = dr 2 + r 2 (d 0 2 + cos 2 0 dtp 2 ). 

Generally d .? 2 can be expressed as 

d.v 2 = gij dx; cLr^-, 

•J 

where the Xj ’s are arbitrary coordinates, and the 
coefficients gjj are components of the metric ten¬ 
sor. These can be functions of the coordinates, 
as in the case of the spherical coordinates. The 
metric tensor of an n -dimensional space can be 
expressed as an n x n matrix. Since dx, dxj — 
dxjdxi, the metric tensor is symmetric, i.e. g,j = 
gji. If all coordinate curves intersect perpendic¬ 
ularly, the coordinate frame is orthogonal. In an 
orthogonal frame, gi j = 0 for all i f j. For exam¬ 
ple, the spherical coordinates form an orthogonal 
frame, the metric tensor of which is 

(10 0 \ 

(gij) = I 0 r 0 . 

\0 0 r 2 cos 2 9) 

If it is possible to find a frame in which all the 
components of g are constant, the space is flat. 
In the rectangular frame of a Euclidean space, 
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we have gn = gn = #33 = 1 ; hence the space 
is flat. The spherical coordinates show that even 
in a flat space, we can use frames in which the 
components of the metric tensor are not constant. 
The line element of a two-dimensional spherical 
surface is obtained from the metric of the spheri¬ 
cal coordinate frame by assigning to r some fixed 
value R : 

ds 2 = R 2 (d9 2 + cos 2 9 df 2 ). 


speed of light follows immediately from the ba¬ 
sic assumptions of the special theory of relativity. 
This is possible only if different observers mea¬ 
sure time intervals differently. 

Let us send a beam of light from the origin. It 
travels along a straight line at the speed c. Thus, 
at the moment t its space and time coordinates 
satisfy the equation 

x 2 y 2 + z 2 — c 2 t 2 . (B.l) 


The metric tensor is 

^S'j) ^0 cos 2 ' 

This cannot be transformed to a constant tensor. 
Thus the surface of a sphere is a cumed space. 

If we know the metric tensor in some frame, 
we can compute a fourth-order tensor, the curva¬ 
ture tensor Rijkl , which tells us whether the space 
is curved or flat. Unfortunately the calculations 
are slightly too laborious to be presented here. 

The metric tensor is needed for all computa¬ 
tions involving distances, magnitudes of vectors, 
areas, and so on. Also, to evaluate a scalar prod¬ 
uct, we must know the metric. In fact the com¬ 
ponents of the metric tensor can be expressed as 
scalar products of the basis vectors: 


Next we study what the situation looks like in an¬ 
other frame, x'y'z !, moving at a velocity v with 
respect to the xyz frame. Let us select the new 
frame so that it coincides with the xyz frame 
at t — 0. Also, let the time coordinate t' of the 
x'y'z' frame be t' = 0 when t — 0. And finally we 
assume that the x'y'z' frame moves in the direc¬ 
tion of the positive x axis. Since the beam of light 
must also travel at the speed c in the x'y'z' frame, 
we must have 


If we require that the new (dashed) coordinates 
be obtained from the old ones by a linear trans¬ 
formation, and also that the inverse transforma¬ 
tion be obtained simply by replacing v by —v, 
we find that the transformation must be 


8ij=ei ■ e j . 

If A and B are two arbitrary vectors 

A = ^a ! e/, B = ^ b 1 e,-, 

i i 

their scalar product is 

a ■ B = a ' bJ ^i- *j = X! sij a>bJ ■ 


* j 


* j 


B.2 Lorentz Transformation. 
Minkowski Space 

The special theory of relativity abandoned the ab¬ 
solute Newtonian time, supposed to flow at the 
same rate for every observer. Instead it required 
that the speed of light must have the same value c 
in all coordinate frames. The constancy of the 


, x — vt 
X — . =, 

yj 1 — V 2 /C 2 

/ = y, 

, (B.2) 

z =z, 

, t — vx/c 2 

y/ 1 — V 2 /c 2 

This transformation between frames moving at 
a constant speed with respect to each other is 
called the Lorentz transformation. 

Because the Lorentz transformation is derived 
assuming the invariance of (B.l), it is obvious 
that the interval 

As 2 = —c 2 At 2 + Ax 2 + Ay 2 + Az 2 


of any two events remains invariant in all Lorentz 
transformations. This interval defines a metric in 
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four-dimensional spacetime. A space having such 
a metric is called the Minkowski space or Lorentz 
space. The components of the metric tensor g are 


(-C 2 

0 

0 

0\ 

0 

1 

0 

0 

0 

0 

1 

0 

V o 

0 

0 

1/ 


Since this is constant, the space is flat. But it is 
no longer an ordinary Euclidean space, because 
the sign of the time component differs from the 
sign of the space components. In the older litera¬ 
ture, a variable ict is often used instead of time, 
i being the imaginary unit. Then the metric looks 
Euclidean, which is misleading; the properties of 
the space cannot be changed just by changing no¬ 
tation. 

In a Minkowski space position, velocity, mo¬ 
mentum and other vector quantities are described 
by four-vectors, which have one time and three 
space components. The components of a four- 
vector obey the Lorentz transformation when we 
transform them from one frame to another, both 
moving along a straight line at a constant speed. 

According to classical physics the distance of 
two events depends on the motion of the observer, 
but the time interval between the events is the 
same for all observers. The world of special rel¬ 
ativity is more anarchistic: even time intervals 
have different values for different observers. 


B.3 General Relativity 


physical reason to assume that the two masses 
should have anything in common. However, al¬ 
ready the experiments made by Galilei showed 
that evidently m\ — m g . This has been verified 
later with very high accuracy. 

The weak equivalence principle, which states 
that m\ — m g , can therefore be accepted as 
a physical axiom. The strong equivalence prin¬ 
ciple generalises this: if we restrict our observa¬ 
tions to a sufficiently small region of spacetime, 
there is no way to tell whether we are subject to 
a gravitational field or are in uniformly acceler¬ 
ated motion. The strong equivalence principle is 
one of the fundamental postulates of general rel¬ 
ativity. 

Curvature of Space General relativity descri¬ 
bes gravitation as a geometric property of the 
spacetime. The equivalence principle is an ob¬ 
vious consequence of this idea. Particles mov¬ 
ing through spacetime follow the shortest possi¬ 
ble paths, geodesics. The projection of a geodesic 
onto a three-dimensional space need not be the 
shortest way between the points. 

The geometry of spacetime is determined by 
the mass and energy distribution. If this distribu¬ 
tion is known, we can write the field equations, 
which are partial differential equations connect¬ 
ing the metric to the mass and energy distribution 
and describing the curvature of spacetime. 

In the case of a single-point mass, the field 
equations yield the Schwarzschild metric, the line 
element of which is 


The Equivalence Principle Newton’s laws re¬ 
late the acceleration, a, of a particle and the ap¬ 
plied force F by 


F — niia, 

where m\ is the inertial mass of the particle, re¬ 
sisting the force trying to move the particle. The 
gravitational force felt by the particle is 

F = nigf, 

where m g is the gravitational mass of the particle, 
and / is a factor depending only on other masses. 
The masses m\ and m g appear as coefficients re¬ 
lated to totally different phenomena. There is no 


d.v 2 



2 GM\ 
c 2 r J 


c 2 dr 2 ■ 


dr 2 

1 - 2 GM/c 2 r 


+ r 2 (&0 2 + cos 2 0 dfi 2 ). 


Here M is the mass of the point; r, 9 and (p are or¬ 
dinary spherical coordinates. It can be shown that 
the components cannot be transformed to con¬ 
stants simultaneously: spacetime must be curved 
due to the mass. 

If we study a very small region of spacetime, 
the curvature has little effect. Locally the space is 
always a Minkowski space where special relativ¬ 
ity can be used. Locality means not only a limited 
spatial volume but also a limited interval in time. 
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Unlike in the Newtonian formalism in general 
relativity there are no equations of motion de¬ 
scribing the motion of a particle. It is possible 
to use the positions of the bodies to compute the 
metric tensor of the space and then the geodesics 
representing the orbits of the bodies. When the 
bodies are moving, the metric of the space keeps 
changing, and thus this method is very laborious. 
The PPN (Parametrised Post-Newtonian) formal¬ 
ism has been developed for practical calculations, 
yielding approximate equations of motion. It con¬ 
tains ten constants, the values of which are spe¬ 
cific for Einstein’s theory of gravity. Alternative 
theories give different values for these constants. 
The formalism therefore provides a framework 
for experimental tests of general relativity. 

The PPN formalism is an approximation that 
can be used if velocities are well below the speed 
of light and the gravitational field is weak and 
hence the curvature of the space is low. 

B.4 Tests of General Relativity 

General relativity gives predictions different from 
classical physics. Although the differences are 
usually very small, there are some phenomena in 
which the deviation can be measured and used to 
test the validity of general relativity. At presents 
five different astronomical tests have verified the 
theory. 

First of all the orbit of a planet is no longer 
a closed Keplerian eclipse. The effect is strongest 
for the innermost planets, whose perihelia should 
turn little by little. Most of the motion of the peri¬ 
helion of Mercury is predicted by Newtonian me¬ 
chanics; only a small excess of 43 arc seconds 
per century remains unexplained. And it so hap¬ 
pens that this is exactly the correction suggested 
by general relativity. 

Secondly a beam of light should bend when 
it travels close to the Sun. For a beam grazing 


the surface, the deviation should be about 1.75". 
Such effects have been observed during total so¬ 
lar eclipses and also by observing pointlike radio 
sources, just before and after occultation by the 
Sun. 

The third classical way of testing general rela¬ 
tivity is to measure the redshift of a photon climb¬ 
ing in a gravitational field. We can understand 
the redshift as an energy loss while the photon 
does work against the gravitational potential. Or 
we can think of the redshift as caused by the 
metric only: a distant observer finds that near 
a mass, time runs slower, and the frequency of 
the radiation is lower. The time dilation in the 
Schwarzschild metric is described by the coeffi¬ 
cient of dr; thus it is not surprising that the radi¬ 
ation emitted at a frequency v from a distance r 
from a mass M has the frequency 



if observed very far from the source. This grav¬ 
itational redshift has been verified by laboratory 
experiments. 

The fourth test employs the slowing down of 
the speed of light in a gravitational field near the 
Sun. This has been verified by radar experiments. 

The previous tests concern the solar system. 
Outside the solar system binary pulsars have been 
used to test general relativity. An asymmetric sys¬ 
tem in accelerated motion (like a binary star) 
loses energy as it radiates gravitational waves. It 
follows that the components approach each other, 
and the period decreases. Usually the gravita¬ 
tional waves play a minor role, but in the case of 
a compact source the effect may be strong enough 
to be observed. The first source to show this pre¬ 
dicted shortening of period was the binary pulsar 
PSR 1913+16. 
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Table C.1 SI basic units 


Quantity 

Symbol 

Unit 

Abbr. 

Definition 

Length 

/, s,... 

Metre 

m 

The length of the path travelled by light in vacuum during a time 
interval of 1 /299,792,458 of a second. 

Mass 

m, M 

Kilogram 

kg 

Equal to the mass of the international prototype of the kilogram. 

Time 

t 

Second 

s 

The duration of 9,192,631,770 periods of the radiation 
corresponding to the transition between two hyperfine levels of 
the ground state of the caesium 133-atom. 

Electric current 

I 

Ampere 

A 

That constant current which, if maintained in two straight 
parallel conductors of infinite length, of negligible circular cross 
section and placed 1 metre apart in a vacuum, would produce a 
force equal to 2 x 10~ 7 Newton per metre of length between 
these conductors. 

Temperature 

T 

Kelvin 

K 

The fraction 1/273.16 of the thermodynamic temperature of the 
triple point of water. 

Amount of substance 

n 

Mole 

mol 

The amount of substance of a system which contains as many 
elementary entities as there, are atoms in 0.012 kg of 12 C. 

Luminous intensity 

i 

Candela 

cd 

The luminous intensity in a given direction of a source which 
emits monochromatic radiation of frequency 540 x 10 12 Hz and 
of which the radiant intensity in that direction is 1/683 Watt per 
steradian. 


Table C.2 Prefixes for orders of ten 


Prefix 

Symbol 

Multiple 

Prefix 

Symbol 

Multiple 

yocto 

y 

io - 24 

deca 

da 

10 1 

zepto 

Z 

io - 21 

hecto 

h 

10 2 

atto 

a 

10-18 

kilo 

k 

10 3 

femto 

f 

10- 15 

Mega 

M 

10 6 

pico 

p 

lO^ 12 

Giga 

G 

10 9 

nano 

n 

10“ 9 

Tera 

T 

10 12 

micro 

p 

10“ 6 

Peta 

P 

10 15 

milli 

m 

10- 3 

Exa 

E 

10 18 

centi 

c 

10- 2 

Zetta 

Z 

10 21 

deci 

d 

10- 1 

Yotta 

Y 

10 24 
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Table C.3 Constants and 
units 


Radian 

1 rad 

= 180°/jr = 57.2957795° = 206,264.8" 

Degree 

1° 

= 0.01745329 rad 

Arc second 

!" 

= 0.000004848 rad 


Velocity of light 

c 

= 299,792.458 ms -1 

Gravitational constant 

G 

= 6.673 x 10~ n m 3 kg -1 s -2 
= An 2 AU 3 Mg 1 a -2 
= 3,986,005 x 10 8 m 3 M^ 1 s“ 2 

Planck constant 

h 

= 6.6261 x 10“ 34 Js 


h 

t h/2n = 1.0546 x 10~ 34 

Boltzmann constant 

k 

= 1.3807 x 10“ 23 J K _1 

Radiation density constant 

a 

= 7.5659 x 10“ 16 Jm- 3 K- 4 

Stefan-Boltzmann constant 

o 

= ac/A = 5.6705 x 10“ 8 Wm _2 K~ 4 

Atomic mass unit 

amu 

= 1.6605 x 10- 27 kg 

Electron volt 

eV 

= 1.6022 x 10“ 19 J 

Electron charge 

e 

= 1.6022 x 10“ 19 C 

Mass of electron 

m e 

= 9.1094 x 10“ 31 kg = 0.511 MeV 

Mass of proton 

m p 

= 1.6726 x 10- 27 kg = 938.3 MeV 

Mass of neutron 

m n 

= 1.6749 x 10- 27 kg = 939.6 MeV 

Mass of 1 H atom 

mu 

= 1.6735 x 10- 27 kg = 1.0078 amu 

Mass of 4 He atom 

mue 

= 6.6465 x 10- 27 kg = 4.0026 amu 

Rydberg constant for 1 H 

Rn 

= 1.0968 x 10 7 m- 1 

Rydberg constant for oo mass 

Roo 

= 1.0974 x 10 7 m^ 1 

Gas constant 

R 

= 8.3145 JK^'mor 1 

Normal atmospheric pressure 

atm 

= 101,325 Pa = 1013 mbar = 760 mmHg 


Astronomical unit 

AU 

= 1.49597870 x 10 11 m 

Parsec 

pc 

= 3.0857 x 10 16 m = 206.265 AU = 3.26 ly 

Light-year 

iy 

= 0.9461 x 10 16 m = 0.3066 pc 
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Table C.4 


Table C.5 

alphabet 


Units of time 


Unit 

Equivalent to 

Sidereal year 

365.2564 d (with respect to fixed stars) 

Tropical year 

365.2422 d (equinox to equinox) 

Anomalistic year 

365.2596 d (perihelion to perihelion) 

Gregorian calendar year 

365.2425 d 

Julian year 

365.25 d 

Julian century 

36,525 d 

Eclipse year 

346.6200 d (with respect to the ascending node 
of the Moon) 

Lunar year 

354.367 d = 12 synodical months 


Synodical month 

29.5306 d (newmoon to newmoon) 

Sidereal month 

27.3217 d (with respect to fixed stars) 

Tropical month 

27.3216 d (with respect to the vernal equinox) 

Anomalistic month 

27.5546 d (perigee to perigee) 

Draconic month 

27.2122 d (node to node) 


Mean solar day 

24 h mean solar time 

= 24 h 03 min 56.56 s sidereal time 
= 1.00273791 sidereal days 

Sidereal day 

24 h sidereal time 

= 23 h 56 min 04.09 s mean solar time 
= 0.99726957 mean solar days 

Rotation period of the Earth 
(referred to fixed stars) 

1.000000097 sidereal days 
= 23 h 56 min 04.10 s mean solar time 


The Greek 


A, a B,i 8 r,y A ,8 E,e,s Z, f H, r) 0,9,$ 

alpha beta gamma delta epsilon zeta eta theta 


I, L 

K, K 

A, X 

M , /x 

N, v 


0,o 

17, 7r, m 

iota 

kappa 

lambda 

mu 

nu 

xi 

omicron 

Pi 


P,P 

U, a, g 

T, T 

T, v 

0, tp, ip 

*,X 


£2, co 

rho 

sigma 

tau 

upsilon 

phi 

chi 

psi 

omega 
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Table C.6 The Sun 


Property 

Symbol 

Numerical value 

Mass 

M 0 _ 

1.989 x 10 30 kg 

Radius 

Rq 

6.960 x 10 8 m = 0.00465 AU 

Effective temperature 

T e 

5785 K 

Luminosity 

L© 

3.9 x 10 26 W 

Apparent visual magnitude 

V 

-26.78 

Colour indices 

B-V 

0.62 


U-B 

0.10 

Absolute visual magnitude 

M v 

4.79 

Absolute bolometric magnitude 

M\)o\ 

4.72 

Inclination of equator to ecliptic 


7° 15' 

Equatorial horizontal parallax 

n O _ 

8.794" 

Motion: direction of apex 


a = 270° 



8 = 30° 

velocity in LSR 


19.7 kms -1 

Distance from galactic centre 


8.5 kpc 


Table C.7 The Earth 


Property 

Symbol 

Numerical value 

Mass 

M® 

t M®/332,946 = 5.974 x 10 24 kg 

Mass, Earth + Moon 

M® + M(j 

t M®/328,900.5 = 6.048 x 10 24 kg 

Equatorial radius 

Re 

16,378,137 m 

Polar radius 

R P 

16,356,752 m 

Flattening 

f 

= (R e - R p )/R e = 1/298.257 

Surface gravity 

g 

= 9.81 ms -2 


Table C.8 The Moon 


Property 

Symbol 

Numerical value 

Mass 


P M®/81.30 = 7.348 x 10 22 kg 

Radius 

% 

p 1738 km 

Surface gravity 

g« 

p 1.62 ms -2 = 0.17g 

Mean equatorial horizontal parallax 

n <L 

p 57' 

Semimajor axis of the orbit 

a 

p384,400 km 

Smallest distance from Earth 

r min 

p356,400 km 

Greatest distance from Earth 

r max 

= 406, 700 km 

Mean inclination of orbit to ecliptic 

L 

= 5.145° 
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Table C.9 Planets. R e = equatorial radius, p = mean 
density, r s ;d = sidereal rotation period (R indicates ret¬ 
rograde rotation), e = inclination of the equator with 
respect to the ecliptic (at the beginning of 2000), 


/ = flattening, T = surface temperature, p = geometric 
albedo, Vo = mean opposition magnitude. The data are 
based mostly on the Astronomical Almanac 


Name 

Re 

[km] 

Mass 

P 

[g/cm 3 ] 

Number of 

known 

satellites 

Planet 

Planet + Satellites 



[kg] 

[Mel 

[M q ] 


Mercury 

2440 

3.30 x 10 23 

0.0553 

1/6,023,600 

5.4 

- 

Venus 

6052 

4.87 x 10 24 

0.8150 

1/408,523.5 

5.2 

- 

Earth 

6378 

5.97 x 10 24 

1.0123 

1/328,900.5 

5.5 

1 

Mars 

3397 

6.42 x 10 23 

0.1074 

1/3,098,710 

3.9 

2 

Jupiter 

71,492 

1.90 x 10 27 

317.89 

1/1047.355 

1.3 

67 

Saturn 

60,268 

5.69 x 10 26 

95.17 

1/3498.5 

0.7 

62 

Uranus 

25,559 

8.66 x 10 25 

14.56 

1/22,869 

1.3 

27 

Neptune 

24,764 

1.03 x 10 26 

17.24 

1/19,314 

1.8 

14 


Name 

Uid 

e 

n 

/ 

T 

[K] 

P 

Vo 

Mercury 

58.646 d 

0.0 

0 

130-615 

0.106 

- 

Venus 

243.019 dR 

177.4 

0 

750 

0.65 

- 

Earth 

23 h 56 min 04.1 s 

23.4 

0.003353 

300 

0.367 

- 

Mars 

24 h 37 min 22.6 s 

25.2 

0.006476 

220 

0.150 

-2.01 

Jupiter 

9 h 55 min 30 s 

3.1 

0.06487 

140 

0.52 

-2.70 

Saturn 

10 h 39 min 22 s 

26.7 

0.09796 

100 

0.47 

+0.67 

Uranus 

17 h 14 min 24 s R 

97.8 

0.02293 

65 

0.51 

+5.52 

Neptune 

16 h 06 min 36 s 

28.3 

0.01708 

55 

0.41 

+7.84 


Table C.10 Magnitudes of the planets. The table gives 
the expressions used in the Astronomical Almanac. They 
give the magnitudes as functions of the phase angle a . In¬ 
terior planets can be observed at relatively large phase an¬ 
gles, and thus several terms are needed to describe their 
phase curves. Phase angles of the exterior planets are al¬ 
ways small, and thus very simple expressions are suffi¬ 
cient. The magnitude of Saturn means the magnitude of 
the planet only; the total magnitude also depends on the 
orientation of the rings 



V(1,0) 

V(l, a) 

Mercury 

-0.36 

V(1,0) + 3.80(q!/ 100 o ) 

- 2.73(a/100°) 2 + 2.00(o-/100°) 3 

Venus 

-4.29 

V(l, 0) + 0.09(a-/100°) 

+ 2.39(a/100°) 2 - 0.65(0-/100°) 3 

Mars 

-1.52 

V(l, 0) + 1.60(a/100°) 

Jupiter 

-9.25 

V(1,0) +0.5(a/100 o ) 

Saturn 

-8.88 


Uranus 

-7.19 

V(l, 0) + 0.28(a/100°) 

Neptune 

-6.87 

V(1,0) 
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Table C.11 Osculating elements of planetary orbits ing node, m = longitude of perihelion, L = men longi- 
on JD 2,451,600.5 (Feb 24, 2000). a = semimajor axis, tude, Aid = mean sidereal orbital period (here 1 a means 
e = eccentricity, i = inclination, Q = longitude of ascend- a Julian year, or 365.25 days), P syn = mean synodic period 

a e 

i Q m L 

^sid ^syn 

[AU] [10 6 km] 

[°] [°] [°] [°] 

[a] [~ [d] p 

Mercury 0.387 57.9 0.2056 

7.01 48.3 77.5 119.4 

0.2408 87.97 115.9 

Venus 0.723 108.2 0.0068 

3.39 76.7 131.9 270.9 

0.6152 224.7 583.9 

Earth 1.000 149.6 0.0167 

0.00 143.9 102.9 155.2 

1.0000 | 365.3 - 

Mars 1.524 228.0 0.0934 

1.85 49.6 336.1 24.5 

1.8807 686.9 779.9 

Jupiter 5.204 778.6 0.0488 

1.30 100.5 15.5 39.0 

11.8565 4330.6 398.9 

Saturn 9.582 1433.4 0.0558 

2.49 113.6 89.9 51.9 

29.4235 10,747 378.1 

Uranus 19.224 2875.8 0.0447 

0.77 74.0 170.3 314.1 

83.747 30,589 369.7 

Neptune 30.092 4501.7 0.0112 

1.77 131.8 39.5 305.5 

163.723 59,800 367.5 


elements of planets with Mercury 

a = 0.38709893 + 0.000000667 

e = 0.20563069 + 0.000025277 

respect to the equator and 

i = 7.00487° -23.51"7 

Q = 48.33167° — 446.30"7 

equinox of J2000.0. The 

m = 77.45645° + 573.57"7 

L = 252.25084° + 4.09233880°f 

days since J2000.0 and Venus 

a =0.72333199 + 0.000000927’ 

! e = 0.00677323 - 0.000049387 

7 is the same time in 

i = 3.39471° -2.86"7 

Q =76.68069° -996.89" 7 

Julian centuries: 

m = 131.53298° - 108.80"7 

L = 181.97973° + 1.60213047°? 

7 = f/36.525. L is the Earth + Moon 

a = 1.00000011 - 0.00000005 r 

e = 0.01671022 - 0.000038047 

mean longitude. 

i =0.00005° -46.94" T 

Q = —11.26064° - 18,228.25"7 

L = M + n t. The elements 

m = 102.94719° + 1198.28"7 

L = 100.46435° + 0.98560910°? 

Q t'C 

terms, and the accuracy of 

a = 1.52366231 - 0.0000722 IT 

[7Z 0.09341233 + 0.000119027 

the positions computed 

( = 1.85061° -25.47"7 

J2 = 49.57854° - 1020.19"7 

from them is of the order 

m = 336.04084° + 1560.78" T 

L = 355.45332° + 0.52403304°? 

The values are from the Jupiter 

a = 5.20336301 + 0.000607377 

e = 0.04839266 - 0.000128807 

Explanatory Supplement to 

i = 1.30530° -4.15"7 

Q = 100.55615° + 1217.17"7 

the Astronomical Almanac. 

m= 14.75385° + 839.93"7 

L = 34.40438° + 0.08308676°? 

describe the orbit of the Saturn 

a = 9.53707032 - 0.003015307 

e = 0.05415060 - 0.000367627 

barycentre of the 

(=2.48446° + 6.11" 7 

Q = 113.71504° - 1591.05"7 

Earth-Moon system 

m = 92.43194° - 1948.89"7 

L = 49.94432° + 0.03346063°? 

Uranus 

a = 19.19126393 + 0.001520257 

e = 0.04716771 - 0.000191507 


( =0.76986° -2.09" 7 

Q =74.22988° + 1681.40" 7 


m = 170.96424° + 1312.56" 7 

L = 313.23218° + 0.01173129°? 

Neptune 

a = 30.06896348 - 0.001251967 

e = 0.00858587 + 0.000025147 


( = 1.76917° -3.64"7 

Q = 131.72169° - 151.25"7 


m = 44.97135° -844.43" 7 

L = 304.88003° + 0.00598106°? 




Table C.13 Largest satellites of the planets. The giant planets have a large number of satellites, and new ones are found frequently. Also the distinction between a large ring 
particle and a small satellite is somewhat arbitrary. Therefore this table is not complete, a = semimajor axis, P^a = sidereal period (tropical for Saturn’s moons), R means that the 
motion is retrograde, e = eccentricity, i = inclination with respect to the equator (E = ecliptic), r = radius, M = mass in planetary masses, p = density (calculated), p = geometric 
albedo and Vq mean opposition magnitude 
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Table C.14 Some well-known asteroids, a = semimajor axis, e = eccentricity, i = inclination, d = diameter, 
r s id = sidereal period of rotation, p = geometric albedo, Vo = mean opposition magnitude 



Asteroidi 

Discoverer 

Year of 
discovery 

a 

[AU] 

e 

i 

n 

d 

[km] 

7 sid 

[h] 

P 

Vo 

Type 

1 

Ceres 

Piazzi 

1801 

2.77 

0.08 

10.6 

946 

9.08 

0.07 

7.9 

C 

2 

Pallas 

Olbers 

1802 

2.77 

0.23 

34.8 

583 

7.88 

0.09 

8.5 

u 

3 

Juno 

Harding 

1804 

2.67 

0.26 

13.0 

249 

7.21 

0.16 

9.8 

s 

4 

Vesta 

Olbers 

1807 

2.36 

0.09 

7.1 

555 

5.34 

0.26 

6.8 

u 

5 

Astraea 

Hencke 

1845 

2.58 

0.19 

5.3 

116 

16.81 

0.13 

11.2 

s 

6 

Hebe 

Hencke 

1847 

2.42 

0.20 

14.8 

206 

7.27 

0.16 

9.7 

s 

7 

Iris 

Hind 

1847 

2.39 

0.23 

5.5 

222 

7.14 

0.20 

9.4 

s 

8 

Flora 

Hind 

1847 

2.20 

0.16 

5.9 

160 

13.60 

0.13 

9.8 

s 

9 

Metis 

Graham 

1848 

2.39 

0.12 

5.6 

168 

5.06 

0.12 

10.4 

s 

10 

Hygiea 

DeGasparis 

1849 

3.14 

0.12 

3.8 

443 

18.00 

0.05 

10.6 

c 

243 

Ida 

Palisa 

1884 

2.86 

0.04 

U 

32 

4.63 

0.16 



433 

Eros 

Witt 

1898 

1.46 

0.22 

10.8 

20 

5.27 

0.18 

11.5 

s 

588 

Achilles 

Wolf 

1906 

5.18 

0.15 

10.3 

70 

? 

r? 

16.4 

u 

624 

Hektor 

Kopff 

1907 

5.16 

0.03 

18.3 

230 

6.92 

0.03 

15.3 

u 

944 

Hidalgo 

Baade 

1920 

5.85 

0.66 

42.4 

30 

10.06 

r? 

19.2 

MEU 

951 

Gaspra 

Neujmin 

1916 

2.21 

0.17 

4.1 

19 

7.04 

0.15 



1221 

Amor 

Delporte 

1932 

1.92 

0.43 

11.9 

5 

? 

r? 

20.4 

? 

1566 

Icarus 

Baade 

1949 

1.08 

0.83 

22.9 

2 

2.27 

r? 

12.3 

u 

1862 

Apollo 

Reinmuth 

1932 

1.47 

0.56 

6.4 

2 

? 

r? 

16.3 

? 

2060 

Chiron 

Kowal 

1977 

13.64 

0.38 

6.9 

320 

? 

r? 

17.3 

r? 

5145 

Pholus 

Rabinowitz 

1992 

20.46 

0.58 

24.7 

190 






Table C.15 Principal meteor showers 


Shower 

Period of 

Maximum 

Radiant 

Meteors 

Comet 


visibility 


a 

<5 

per hour 


Quadrantids 

Jan. 1-5 

Jan. 3^1 

15.5 h 

+50° 

30-40 


Lyrids 

Apr. 19-25 

Apr. 22 

18.2 h 

+34° 

10 

Thatcher 

Tj Aquarids 

May 1-12 

May 5 

22.4 h 

-1° 

5-10 

Halley 

Perseids 

Jul. 20-Aug. 18 

Aug. 12 

3.1 h 

+58° 

40-50 

Swift-Tuttle 

K Cygnids 

Aug. 17-24 

Aug. 20 

19.1 h 

+59° 

5 


Orionids 

Oct. 17-26 

Oct. 21 

6.3 h 

+ 16° 

10-15 

Halley 

Taurids 

Oct. 10-Dec. 5 

Nov. 1 

3.8 h 

+ 14°,+22° 

5 

Encke 

Leonids 

Nov. 14-20 

Nov. 17 

10.2 h 

+22° 

10 

Tempel-Tuttle 

Geminids 

Dec. 7-15 

Dec. 13-14 

7.5 h 

+33° 

40-50 


Ursids 

Dec. 17-24 

Dec. 22 

13.5 h 

+78° 

5 
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Table C.16 Periodic comets with several perihelion 
passages observed. N = number of passages observed, 
T = time of perihelion passage, P = sidereal period, 
q = perihelion distance, e = eccentricity, w = argument 
of perihelion (1950.0), 12 = longitude of ascending node 
(1950.0), i = inclination, / = longitude of perihelion, de¬ 


fined here as l = Q + arctan(tann;cosi), b = latitude of 
perihelion (sin b = sin co sin i ), Q = aphelion distance. The 
elements are affected by planetary perturbations as well 
as reaction forces due to evaporating material the amount 
of which is difficult to predict 


Comet 

N 

r 

P 

[a] 

7 

[AU] 

e 

CO 

n 

Q 

n 

i 

n 

/ 

n 

b 

n 

Q 

[AU] 

Encke 

56 

Feb. 9, 1994 

3.28 

0.331 

0.850 

186.3 

334.0 

11.9 

160.2 

-1.3 

4.09 

Grigg-Skjellerup 

16 

JuL 24, 1992 

5.10 

0.995 

0.664 

359.3 

212.6 

21.1 

212.0 

-0.3 

4.93 

Honda-Mrkos-Pajdusakova 

8 

Sep. 12, 1990 

5.30 

0.541 

0.822 

325.8 

88.6 

4.2 

54.5 

-2.4 

5.54 

Tuttle-Giacobini-Kresak 

8 

Feb. 8, 1990 

5.46 

1.068 

0.656 

61.6 

140.9 

9.2 

202.1 

8.1 

5.14 

Tempel 2 

19 

Mar. 16. 1994 

5.48 

1.484 

0.522 

194.9 

117.6 

12.0 

312.1 

-3.1 

4.73 

Wirtanen 

8 

Sep. 21. 1991 

5.50 

1.083 

0.652 

356.2 

81.6 

11.7 

77.9 

-0.8 

5.15 

Clark 

8 

Nov. 28, 1989 

5.51 

1.556 

0.501 

208.9 

59.1 

9.5 

267.7 

-4.6 

4.68 

Forbes 

8 

Mar. 15, 1993 

6.14 

1.450 

0.568 

310.6 

333.6 

7.2 

284.5 

-5.4 

5.25 

Pons-Winnecke 

20 

Aug. 19. 1989 

6.38 

1.261 

0.634 

172.3 

92.8 

22.3 

265.6 

2.9 

5.62 

d" Arrest 

15 

Feb. 4. 1989 

6.39 

1.292 

0.625 

177.1 

138.8 

19.4 

316.0 

1.0 

5.59 

Schwassmann-Wachmann 2 

11 

Jan. 24, 1994 

6.39 

2.070 

0.399 

358.2 

125.6 

3.8 

123.8 

-0.1 

4.82 

Wolf-Harrington 

8 

Apr. 4, 1991 

6.51 

1.608 

0.539 

187.0 

254.2 

18.5 

80.8 

-2.2 

5.37 

Ciacobini-Zinner 

12 

Apr. 14. 1992 

6.61 

1.034 

0.707 

172.5 

194.7 

31.8 

8.3 

3.9 

6.01 

Reinmuth 2 

8 

Jun. 29, 1994 

6.64 

1.893 

0.464 

45.9 

295.4 

7.0 

341.1 

5.0 

5.17 

Perrine-Mrkos 

8 

Mar. 1, 1989 

6.78 

1.298 

0.638 

166.6 

239.4 

17.8 

46.6 

4.1 

5.87 

Arend-Rigaux 

7 

Oct. 3, 1991 

6.82 

1.438 

0.600 

329.1 

121.4 

17.9 

91.7 

-9.1 

5.75 

Borrelly 

11 

Dec. 18, 1987 

6.86 

1.357 

0.624 

353.3 

74.8 

30.3 

69.0 

-3.4 

5.86 

Brooks 2 

14 

Sep. 1, 1994 

6.89 

1.843 

0.491 

198.0 

176.2 

5.5 

14.1 

-1.7 

5.40 

Finlay 

11 

Jun. 5, 1988 

6.95 

1.094 

0.700 

322.2 

41.7 

3.6 

4.0 

-2.2 

6.19 

Johnson 

7 

Nov. 19, 1990 

6.97 

2.313 

0.366 

208.3 

116.7 

13.7 

324.3 

-6.4 

4.98 

Daniel 

8 

Aug. 31. 1992 

7.06 

1.650 

0.552 

11.0 

68.4 

20.1 

78.7 

3.8 

5.71 

Holmes 

8 

Apr. 10. 1993 

7.09 

2.177 

0.410 

23.2 

327.3 

19.2 

349.4 

7.4 

5.21 

Reinmuth 1 

8 

May 10, 1988 

7.29 

1.869 

0.503 

13.0 

119.2 

8.1 

132.0 

1.8 

5.65 

Faye 

19 

Nov. 15. 1991 

7.34 

1.593 

0.578 

204.0 

198.9 

9.1 

42.6 

-3.7 

5.96 

Ashbrook-Jackson 

7 

JuL 13, 1993 

7.49 

2.316 

0.395 

348.7 

2.0 

12.5 

350.9 

-2.4 

5.34 

Schaumasse 

10 

Mar. 5, 1993 

8.22 

1.202 

0.705 

57.5 

80.4 

11.8 

137.3 

10.0 

6.94 

Wolf 

14 

Aug. 28. 1992 

8.25 

2.428 

0.406 

162.3 

203.4 

27.5 

7.6 

8.1 

5.74 

Whipple 

9 

Dec. 22, 1994 

8.53 

3.094 

0.259 

201.9 

181.8 

9.9 

23.4 

-3.7 

5.25 

Comas Sola 

8 

Aug. 18. 1987 

8.78 

1.830 

0.570 

45.5 

60.4 

13.0 

105.2 

9.2 

6.68 

Vaisala 1 

6 

Apr. 29, 1993 

10.8 

1.783 

0.635 

47.4 

134.4 

11.6 

181.2 

8.5 

7.98 

Tuttle 

11 

Jun. 27, 1994 

13.5 

0.998 

0.824 

206.7 

269.8 

54.7 

106.1 

-21.5 

10.3 

Halley 

30 

Feb. 9, 1986 

76.0 

0.587 

0.967 

111.8 

58.1 

162.2 

305.3 

16.4 

35.3 
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Table C.17 Nearest stars. V = apparent visual magnitude, B — V = colour index, r = distance, /r = proper motion, 
v r = radial velocity (positive for receding objects) 


Name 

“2000 

M 

[mini 

^2000 

[°] 

n 

V 

B-V 

Spectrum 

r 

[pc] 

B 

[>] 

Vx 

[km/s] 

Sun 





-26.8 

0.6 

G2V 




a Cen C (Proxima) 

14 

29.7 

-62 

41 

11.0 

2.0 

M5eV 

1.30 

3.9 

-16 

a Cen A 

14 

39.6 

-60 

50 

-0.0 

0.7 

G2V 

1.33 

3.7 

-22 

a CenB 

14 

39.6 

-60 

50 

1.3 

0.9 

K1V 

1.33 

3.7 

-22 

Barnard's star 

17 

57.8 

4 

42 

9.5 

1.7 

M5V 

1.83 

10.3 

-108 

Wolf 359 

10 

56.5 

7 

01 

13.5 

2.0 

M6eV 

2.39 

4.7 

+ 13 

BD+36°2147 

11 

03.3 

35 

58 

7.5 

1.5 

M2V 

2.54 

4.8 

-86 

a CMa (Sirius) A 

6 

45.1 

-16 

43 

-1.5 

0.0 

A1V 

2.66 

1.3 

-8 

a CMa (Sirius) B 

6 

45.1 

-16 

43 

8.4 


wdA 

2.66 

1.3 

-8 

Luyten 726-8 A 

1 

39.0 

-17 

57 

12.5 


M6eV 

2.66 

3.3 

+29 

Luyten 726-8 B (UV Cet) 

1 

39.0 

-17 

57 

13.0 


M6eV 

2.66 

3.3 

+32 

Ross 154 

18 

49.8 

-23 

50 

10.4 


M4eV 

2.92 

0.7 

-4 

Ross 248 

23 

41.9 

44 

11 

12.2 

1.9 

M5eV 

3.13 

1.6 

-81 

s Eri 

3 

32.9 

-9 

27 

3.7 

0.9 

K2V 

3.26 

1.0 

+ 16 

Ross 128 

11 

47.7 

0 

48 

LL1 

1.8 

M5V 

3.31 

1.4 

-13 

Luyten 789-6 A 

22 

38.6 

-15 

17 

12.8 

2.0 

M5eV 

3.40 

3.3 

-60 

Luyten 789-6 B 

22 

38.6 

-15 

17 

13.3 



3.40 

3.3 

-60 

BD+43°44 A 

0 

18.4 

44 

01 

8.1 

1.6 

M3V 

3.44 

2.9 

+ 13 

BD+43°44B 

0 

18.4 

44 

01 

11.0 

1.8 

M6V 

3.44 

2.9 

+20 

e Ind 

22 

03.4 

-56 

47 

4.7 

1.1 

K5V 

3.45 

4.7 

-40 

BD+59°1915 A 

18 

42.8 

59 

38 

8.9 

1.5 

M4V 

3.45 

2.3 

0 

BD+59°1915B 

18 

42.8 

59 

38 

9.7 

1.6 

M4V 

3.45 

2.3 

+ 10 

61 CygA 

21 

06.9 

38 

45 

5.2 

1.2 

K5V 

3.46 

5.2 

-64 

61 CygB 

21 

06.9 

38 

45 

6.0 

1.4 

K7V 

3.46 

5.2 

-64 

t Cet 

1 

43.1 

-15 

56 

3.5 

0.7 

G8V 

3.48 

1.9 

-16 

CD—36°15693 

23 

05.9 

-35 

51 

7.4 

1.5 

M2V 

3.51 

6.9 

+ 10 

a CMi (Procyon) A 

7 

39.3 

5 

14 

0.4 

0.4 

F5IV 

3.51 

1.3 

-3 

a CMi (Procyon) B 

7 

39.3 

5 

14 

10.7 


wdF 

3.51 

1.3 


G51-15 

8 

29.8 

26 

47 

14.8 


M7V 

3.62 

1.3 


BD+5°1668 

7 

27.4 

5 

13 

9.8 

1.6 

M4V 

3.76 

3.8 

+26 

Luyten 725-32 

1 

12.6 

-17 

00 

11.8 


M6eV 

3.77 

1.4 


Kapteyn's star 

5 

11.7 

-45 

01 

OO 

OO 

1.6 

M1VI 

3.85 

OO 

oo 

+245 

CD—39°14192 

21 

17.2 

-38 

52 

6.7 

1.4 

MOeV 

3.85 

3.5 

+21 

Kruger 60 A 

22 

28.0 

57 

42 

9.9 

1.6 

M3V 

3.95 

0.9 

-26 

Kruger 60 B 

22 

28.0 

57 

42 

11.5 

1.8 

M4eV 

3.95 

0.9 

-26 

Ross 614 A 

6 

29.4 

-2 

49 

11.2 

1.7 

M4eV 

4.13 

1.0 

+24 

Ross 614 B 

6 

29.4 

-2 

49 

14.8 



4.13 

1.0 

+24 

BD—12°4523 

16 

30.3 

-12 

40 

10.2 

1.6 

M5V 

4.15 

1.2 

-13 

Wolf 424 A 

12 

33.3 

9 

01 

13.2 

1.8 

M6V 

4.29 

1.8 

-5 

Wolf 424 B 

12 

33.3 

9 

01 

13.2 



4.29 

1.8 

-5 
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Table C.17 (Continued) 


Name 

“2000 

M 

[min] 

■52000 

n 

n 

V 

B-V 

Spectrum 

r 

[pc] 

B 

["/a] 

v r 

[km/s] 

van Maanen’s star 

0 

49.2 

5 

23 

12.4 

0.6 

wdG 

4.33 

3.0 

+54 

Luyten 1159-16 

2 

00.2 

13 

03 

12.2 


M5eV 

4.48 

2.1 


CD—37°15492 

0 

05.4 

-37 

21 

8.6 

1.5 

M3V 

4.48 

6.1 

+23 

Luyten 143-23 

10 

44.5 

-61 

12 

13.9 


dM 

4.48 

1.7 


CD—46°11540 

17 

28.7 

-46 

54 

9.4 

1.5 

M3 

4.52 

1.1 


LP731-58 

10 

48.2 

-11 

20 

15.6 


M7V 

4.55 

1.6 


Luyten 145-141 

11 

45.7 

-64 

50 

11.4 

0.2 

wdA 

4.57 

2.7 


BD+68°946 

17 

36.4 

68 

20 

9.1 

1.5 

M3V 

4.63 

1.3 

-22 

CD—49°13515 

21 

33.6 

-49 

01 

8.7 

1.5 

M2V 

4.63 

0.8 

+8 

BD+50°1725 

10 

11.3 

49 

27 

6.6 

1.4 

K2V 

4.67 

1.5 

-26 

G 158-27 

0 

06.7 

-07 

32 

13.7 


M5V 

4.67 

2.1 


BD—15°6290 

22 

53.3 

-14 

18 

10.2 

1.6 

M4V 

4.69 

1.1 

+9 

CD—44°11909 

17 

37.1 

-44 

19 

11.0 


M5V 

4.72 

1.1 


G 208^14/45 A 

19 

53.9 

44 

25 

13.4 


M6eV 

4.72 

0.7 


G 208^14/45 B 

19 

53.9 

44 

25 

14.3 


dM 

4.72 

0.7 


G 208^14/45 C 

19 

53.9 

44 

25 

15.5 


dM 

4.72 

0.7 


o 2 Eri A 

4 

15.3 

-7 

39 

4.4 

0.8 

K0V 

4.76 

4.0 

-43 

o 2 Eri B 

4 

15.3 

-7 

39 

9.5 

0.0 

wdA 

4.76 

4.0 

-21 

o 2 Eri C 

4 

15.3 

-7 

39 

11.2 

1.7 

M4eV 

4.76 

4.0 

-45 

BD+20°2465 

10 

19.6 

19 

52 

9.4 

1.5 

M4V 

4.88 

0.5 

+ 11 

70 Oph A 

18 

05.5 

2 

30 

4.2 

0.9 

K0V 

4.98 

1.1 

-7 

70 OphB 

18 

05.5 

2 

30 

6.0 


K5V 

4.98 

1.1 

-10 

BD+44°2051 A 

11 

05.5 

43 

32 

8.7 


M2V 

5.00 

4.5 

+65 

BD+44°2051B 

11 

05.5 

43 

32 

14.4 


M5eV 

5.00 

4.5 

+65 

a Aql (Altair) 

19 

50.8 

8 

52 

0.8 

0.2 

A7V 

5.08 

0.7 

-26 


Table C.18 Brightest stars (V < 2). V = apparent visual magnitude, B — V = colour index, r = distance. Remarks: 
b = binary, sb = spectroscopic binary, v = variable 


Name 


“2000 

[h] 

[min] 

■52000 

n 

n 

V 

B-V 

Spectrum 

r 

[pc] 

Remarks 

a CMa 

Sirius 

6 

45.2 

-16 

43 

-1.5 

0.0 

AIV, wdA 

2.7 

b 

a Car 

Canopus 

6 

24.0 

-52 

42 

-0.7 

0.2 

A91I 

60 


a Cen 

Rigil 

14 

39.6 

-60 

50 

-0.3 

0.7 

G2V,K1V 

1.3 

b, Proxima 


Kentaurus 









2.2° apart 

a Boo 

Arcturus 

14 

15.7 

19 

11 

-0.0 

1.2 

K2IIIp 

11 


a Lyr 

Vega 

18 

36.9 

38 

47 

0.0 

0.0 

A0V 

8 


a Aur 

Capella 

5 

16.7 

46 

00 

0.1 

0.8 

G2III,G6III 

14 

b 

/I Ori 

Rigel 

5 

14.5 

-8 

12 

0.1 

-0.0 

BSIa 

90 

b 

a CMi 

Procyon 

7 

39.3 

5 

14 

0.4 

0.4 

F5IV,wdF 

3.5 

b 

a Eri 

Achernar 

1 

37.7 

-57 

14 

0.5 

-0.2 

B3Vp 

40 
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Table C.18 (Continued) 


Name 


“2000 

[h] 

[min] 

£>000 

[°] 

n 

V 

B-V 

Spectrum 

r 

[pci 

Remarks 

a Ori 

Betelgeuze 

5 

55.2 

7 

24 

0.5 

1.9 

M2I 

200 

v 0.4 — 1.3, sb 

/3Cen 

Hadar 

14 

03.8 

-60 

22 

0.6 

-0.2 

B1III 

60 

b 

a Aql 

Altair 

19 

50.8 

8 

52 

0.8 

0.2 

A7V 

5.1 


a Cm 

Acrux 

12 

26.6 

-63 

06 

0.8 

-0.3 

B0.5IV.B1V 

120 

b 1.6 + 2.1 

a Tau 

Aldebaran 

4 

35.9 

16 

31 

0.9 

1.5 

K5III 

20 

b, v 

a Vir 

Spica 

13 

25.2 

-11 

10 

1.0 

-0.2 

B1IV 

50 

sb, several comp. 

a Sco 

Antares 

16 

29.4 

-26 

26 

1.0 

1.8 

M1.5LB2.5V 

50 

v 0.9- 1.8 

P Gem 

Pollux 

7 

45.3 

28 

02 

1.2 

1.1 

K0III 

11 


a PsA 

Fomalhaut 

22 

57.6 

-29 

37 

1.2 

0.1 

A3V 

7.0 


“ Cyg 

Deneb 

20 

41.4 

45 

17 

1.3 

0.1 

A2Ia 

500 


ft Cm 

Mimosa 

12 

47.7 

-59 

41 

1.3 

-0.2 

B0.5III 

150 

v, sb 

a Leo 

Regulus 

10 

08.4 

11 

58 

1.4 

-0.1 

B7V 

26 

b 

eCMa 

Adhara 

6 

58.6 

-28 

58 

1.5 

-0.2 

B2I1 

170 

b 

a Gem 

Castor 

7 

34.6 

31 

53 

1.6 

0.0 

A1V.A2V 

14 

b 

y Cru 

Gacrux 

12 

31.2 

-57 

07 

1.6 

1.6 

M3.5III 

40 

V 

X Sco 

Shaula 

17 

33.6 

-37 

06 

1.6 

-0.2 

B1.5IV 


V 

y Ori 

Bellatrix 

5 

25.1 

6 

21 

1.6 

-0.2 

B2III 

40 


P Tau 

Elnath 

5 

26.3 

28 

36 

1.7 

-0.1 

B7III 

55 


pCav 

Miaplacidus 

9 

13.2 

-69 

43 

1.7 

0.0 

A1III 

30 


e Ori 

Alnilam 

5 

36.2 

-1 

12 

1.7 

-0.2 

BOIa 



a Gm 

A1 Na’ir 

22 

08.2 

-46 

58 

1.7 

-0.1 

B7IV 

20 


eUMa 

Alioth 

12 

54.0 

55 

58 

1.8 

-0.0 

AOIVp 

120 

V 

y Vel 

Regor 

8 

09.5 

-47 

20 

1.8 

-0.2 

WC8.B1IV 


b 1.8 + 4.3, each sb 

a Per 

Mirfak 

3 

24.3 

49 

52 

1.8 

0.5 

F5Ib 

35 


a UMa 

Dubhe 

11 

03.7 

61 

45 

1.8 

1.1 

K0III 

30 

b 

eSgr 

Kaus Australis 

18 

24.2 

-34 

23 

1.9 

-0.0 

A01I 

70 


.5 CMa 

Wezen 

7 

08.4 

-26 

23 

1.9 

0.7 

F8Ia 



e Car 

Avior 

8 

22.5 

-59 

31 

1.9 

1.3 

K3III,B2V 

25 

sb 

t) UMa 

Alkaid 

13 

47.5 

49 

19 

1.9 

-0.2 

B3V 



9 Sco 

Girtab 

17 

37.3 

-43 

00 

1.9 

0.4 

F0II 

50 


p Aur 

Menkalinan 

5 

59.6 

44 

57 

1.9 

0.0 

A1IV 

30 


f Ori 

Alnitak 

5 

40.8 

-1 

57 

1.9 

-0.2 

O9.5Ib,B0III 

45 

b2.1 +4.2 

a TrA 

Atria 

16 

48.7 

-69 

02 

1.9 

1.4 

K2I1-III 

40 


y Gem 

Alhena 

6 

37.7 

16 

24 

1.9 

0.0 

A1IV 

30 


a Pav 

Peacock 

20 

25.7 

-56 

44 

1.9 

-0.2 

B3V 



S Vel 


8 

44.7 

-54 

43 

2.0 

0.0 

A1V 

20 


P CMa 

Mirzam 

6 

22.7 

-17 

57 

2.0 

-0.2 

B111—III 

70 

V 

a Hya 

Alphard 

9 

27.6 

-8 

40 

2.0 

1.4 

K31I-III 

60 


a Ari 

Hamal 

2 

07.2 

23 

28 

2.0 

1.2 

K2III 

25 
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Table C.19 Some double stars. Magnitudes of the components are m\ and m 2 , and the angular separation d: r is the 
distance of the star 


Name 


“2000 

M 

[min] 

$2000 

n 

n 

m\ 

m 2 

Spectrum 


d 

["] 

r 

[pc] 

r\ Cas 

Achird 

0 

49.1 

57 

49 

3.7 

7.5 

GOV 

M0 

12 

6 

Y Ari 

Mesarthim 

1 

53.5 

19 

18 

4.8 

4.9 

Alp 

B9V 

8 

40 

a Psc 

Alrescha 

2 

02.0 

2 

46 

4.3 

5.3 

AOp 

A3m 

2 

60 

y And 

Alamak 

2 

03.9 

42 

20 

2.4 

5.1 

K3IIb 

B8V.A0V 

10 

100 

.5 Ori 

Mintaka 

5 

32.0 

-0 

18 

2.5 

7.0 

B0III,O9V 

B2V 

52 

70 

X Ori 

Meissa 

5 

35.1 

9 

56 

3.7 

5.7 

08e 

B0.5V 

4 

140 

? Ori 

Alnitak 

5 

40.8 

-1 

56 

2.1 

4.2 

09.51be 

B0III 

2 

40 

a Gem 

Castor 

7 

34.6 

31 

53 

2.0 

3.0 

A1V 

A5Vm 

3 

15 

Y Leo 

Algieba 

10 

20.0 

19 

50 

2.6 

3.8 

Kim 

G7III 

4 

80 

| UMa 

Alula Australis 

11 

18.2 

31 

32 

4.4 

4.9 

GOV 

GOV 

1 

7 

a Cru 

Acrux 

12 

26.6 

-63 

06 

1.6 

2.1 

B0.5IV 

B1V 

4 

120 

Y Vir 

Porrima 

12 

41.7 

-1 

27 

3.7 

3.7 

F0V 

F0V 

3 

10 

a CVn 

Cor Caroli 

12 

56.1 

38 

18 

2.9 

5.5 

AOp 

F0V 

20 

40 

? UMa 

Mizar 

13 

23.9 

54 

56 

2.4 

4.1 

AlVp 

Aim 

14 

21 

a Cen 

Rigil Kentaurus 

14 

39.6 

-60 

50 

0.0 

1.3 

G2V 

K1V 

21 

1.3 

e Boo 

Izar 

14 

45.0 

27 

04 

2.7 

5.3 

K0II-III 

A2V 

3 

60 

.5 Ser 


15 

34.8 

10 

32 

4.2 

5.3 

F0IV 

F01V 

4 

50 

/3Sco 

Graffias 

16 

05.4 

-19 

48 

2.6 

4.9 

B1V 

B2V 

14 

110 

a Her 

Rasalgethi 

17 

14.6 

14 

23 

34L4.0 

5.7 

M5Ib-II 

G5IILF2V 

5 

120 

p Her 


17 

23.7 

37 

08 

4.5 

5.5 

B9.5III 

A0V 

4 


70 Oph 


18 

05.5 

2 

30 

4.3 

6.1 

K0V 

K5V 

2 

5 

e Lyr 


18 

44.3 

39 

40 

4.8 

4.4 

A4VF1V 

A8V,F0V 

208 

50 

e 1 Lyr 


18 

44.3 

39 

40 

5.1 

6.2 

A4V 

F1V 

3 

50 

e 2 Lyr 


18 

44.4 

39 

37 

5.1 

5.3 

A8V 

F0V 

2 

50 

K Lyr 


18 

44.8 

37 

36 

4.3 

5.7 

Am 

F01V 

44 

30 

9 Ser 

Alya 

18 

56.2 

4 

12 

4.5 

4.9 

A5V 

A5V 

22 

30 

Y Del 


20 

46.7 

16 

07 

4.5 

5.4 

K1IV 

F7V 

10 

40 

? Aqr 


22 

28.8 

-0 

01 

4.4 

4.6 

F3V 

F6IV 

2 

30 

S Cep 


22 

29.1 

58 

24 

3.5^k3 

7.5 

F5Ib-G21b 

B7IV 

41 

90 
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Table C.20 Milky Way Galaxy 


Property 

Value 

Mass 

[> 2 x 10 u M e 

Disc diameter 

30 kpc 

Disc thickness (stars) 

1 kpc 

Disc thickness (gas and dust) 

200 pc 

Halo diameter 

50 kpc 

Sun's distance from the centre 

8.5 kpc 

Sun's orbital velocity 

220 kms -1 

Sun's period 

240 x 10 6 a 

Direction of the centre (2000.0) 

a = 17 h 45.7 min 


S~= —29° 00' 

Direction of the north pole (2000.0) 

o' = 12h51.4 min 

S = +27°08' 

Galactic coordinates of the 

I = 123° 00' 

celestial north pole 

b = +27° 08' 
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Table C.21 Members of the Local Group of Galaxies. V = apparent visual magnitude. My = absolute visual magni¬ 
tude, r = distance 



a'2000 

<$2000 

Type 

V 

My 

r 


m 

[min] 

n 

['] 




[kpc] 

Milky Way 

17 

45.7 

-29 

00 

Sbc 


-20.9 

8 

NGC224 = M31 

00 

42.7 

41 

16 

Sb 

3.2 

-21.2 

760 

NGC598 = M33 

01 

33.8 

30 

30 

Sc 

5.6 

-18.9 

790 

Large Magellanic Cloud 

05 

19.6 

-69 

27 

Irr 

0.0 

-18.5 

50 

Small Magellanic Cloud 

00 

52.6 

-72 

48 

Irr 

1.8 

-17.1 

60 

NGC221 =M32 

00 

42.7 

40 

52 

E2 

7.9 

-16.5 

760 

NGC205 

00 

40.4 

41 

41 

dE5 

8.0 

-16.4 

760 

IC10 

00 

20.4 

59 

17 

Irr 

7.8 

-16.3 

660 

NGC6822 

19 

44.9 

-14 

48 

Irr 

7.5 

-16.0 

500 

NGC185 

00 

39.0 

48 

20 

dE3 

8.5 

-15.6 

660 

IC 1613 

01 

04.8 

02 

08 

Irr 

9.0 

-15.3 

720 

NGC 147 

00 

33.2 

48 

30 

dE4 

9.0 

-15.1 

660 

WLM 

00 

02.0 

-15 

28 

Irr 

10.4 

-14.4 

930 

Sagittarius 

18 

55.1 

-30 

29 

dE7 

3.1 

-13.8 

24 

Fornax 

02 

39.9 

-34 

30 

dE3 

7.6 

-13.1 

140 

Pegasus 

23 

28.6 

14 

45 

Irr 

12.1 

-12.3 

760 

Leo I 

10 

08.4 

12 

18 

dE3 

10.1 

-11.9 

250 

And II 

01 

16.5 

33 

26 

dE3 

12.4 

-11.8 

700 

And I 

00 

45.7 

38 

00 

dEO 

12.7 

-11.8 

810 

Leo A 

09 

59.4 

30 

45 

Irr 

12.7 

-11.5 

690 

Aquarius 

20 

46.9 

-12 

51 

Irr 

13.7 

-11.3 

1020 

SagDIG 

19 

30.0 

-17 

41 

Irr 

15.0 

-10.7 

1400 

Pegasus II = And VI 

23 

51.7 

24 

36 

dE 

14.0 

-10.6 

830 

Pisces = LGS 3 

01 

03.9 

21 

54 

Irr 

14.1 

-10.4 

810 

And III 

00 

35.3 

36 

30 

dE6 

14.2 

-10.2 

760 

And V 

01 

10.3 

47 

38 

dE 

14.3 

-10.2 

810 

Leo II 

11 

13.5 

22 

10 

dEO 

11.5 

-10.1 

210 

Cetus 

00 

26.1 

-11 

02 

dE 

14.3 

-10.1 

780 

Sculptor 

01 

00.1 

-33 

43 

dE3 

10.0 

-9.8 

90 

Phoenix 

01 

51.1 

-44 

27 

Irr 

13.2 

-9.8 

400 

Tucana 

22 

41.8 

-64 

25 

dE5 

15.1 

-9.6 

870 

Sextans 

10 

13.0 

-01 

37 

dE4 

10.3 

-9.5 

90 

Cassiopeia = And VII 

23 

26.5 

50 

42 

dE 

14.7 

-9.5 

690 

Carina 

06 

41.6 

-50 

58 

dE4 

10.6 

-9.4 

100 

Ursa Minor 

15 

08.8 

67 

07 

dE5 

10.0 

-8.9 

60 

Draco 

17 

20.3 

57 

55 

dE3 

10.9 

-8.6 

80 

Ursa Major 

158 

43.2 

51 

55 

dE 

13.2 

-6.8 

100 

Canes Venatici 

13 

28.0 

33 

33 

dE 

13.9 

-7.9 

220 

Bootes 

14 

00.0 

14 

30 

dE 

13.3 

-5.7 

60 

Ursa Major II 

08 

51.5 

63 

08 

dE 

14.3 

-3.8 

30 
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Table C.21 (Continued) 



“2000 

^2000 

Type 

V 

M v 

r 

[kpc] 

[h] 

[min] 

n 

n 

Coma Berenices 

12 

27.0 

23 

54 

dE 

14.5 

-3.7 

44 

Canes Venatici 11 

12 

57.2 

34 

19 

dE 

15.1 

-4.8 

150 

Hercules 

16 

31.0 

12 

47.5 

dE 

14.7 

-6.0 

140 

Leo IV 

11 

33.0 

-0 

32 

dE 

15.9 

-5.1 

160 

And IX 

00 

52.9 

43 

12 

dE 

16.2 

-8.3 

760 

And X 

01 

06.5 

44 

48 

dE 

16.1 

-8.1 

710 

And XI 

00 

46.3 

33 

48 

dE 

17.2 

-7.3 

760 

And XII 

00 

47.5 

34 

22 

dE 

18.1 

-6.4 

760 

And XIII 

00 

51.8 

33 

00 

dE 

17.6 

-6.9 

760 


Table C.22 Optically brightest galaxies. B = apparent blue magnitude, d = apparent diameter, r = distance 


Name 

“2000 

^2000 

Type 

B 

d 

r 


[h] 

[min] 

n 

n 



["] 

[Mpc] 

NGC55 

0 

15.1 

-39 

13 

Sc/Irr 

7.9 

30x5 

2.3 

NGC205 

0 

40.4 

41 

41 

E6 

8.9 

12 x6 

0.7 

NGC221 =M32 

0 

42.7 

40 

52 

E2 

9.1 

3.4 x 2.9 

0.7 

NGC224 = M31 

0 

42.8 

41 

16 

Sb 

4.3 

163x42 

0.7 

NGC247 

0 

47.2 

-20 

46 

S 

9.5 

21 x8 

2.3 

NGC253 

0 

47.6 

-25 

17 

Sc 

7.0 

22x5 

2.3 

Small Magellanic Cloud 

0 

52.6 

-72 

48 

Irr 

2.9 

216x216 

0.06 

NGC300 

0 

54.9 

-37 

41 

Sc 

8.7 

22 x 16 

2.3 

NGC598 = M33 

1 

33.9 

30 

39 

Sc 

6.2 

61 x42 

0.7 

Fornax 

2 

39.9 

-34 

32 

dE 

9.1 

50x35 

0.2 

Large Magellanic Cloud 

5 

23.6 

-69 

45 

Irr/Sc 

0.9 

432 x 432 

0.05 

NGC2403 

7 

36.9 

65 

36 

Sc 

OO 

OO 

22 x 12 

2.0 

NGC2903 

9 

32.1 

21 

30 

Sb 

9.5 

16x7 

5.8 

NGC3031 =M81 

9 

55.6 

69 

04 

Sb 

7.8 

25 x 12 

2.0 

NGC3034 = M82 

9 

55.9 

69 

41 

Sc 

9.2 

10 x 1.5 

2.0 

NGC 4258 = M 106 

12 

19.0 

47 

18 

Sb 

8.9 

19x7 

4.3 

NGC4472 = M49 

12 

29.8 

8 

00 

E4 

9.3 

10x7 

11 

NGC 4594 = M 104 

12 

40.0 

-11 

37 

Sb 

9.2 

8x5 

11 

NGC 4736 = M94 

12 

50.9 

41 

07 

Sb 

8.9 

13x12 

4.3 

NGC 4826 = M64 

12 

56.8 

21 

41 


9.3 

10x4 

3.7 

NGC 4945 

13 

05.4 

-49 

28 

Sb 

8.0 

20x4 

3.9 

NGC 5055 = M63 

13 

15.8 

42 

02 

Sb 

9.3 

8x3 

4.3 

NGC5128 = CenA 

13 

25.5 

-43 

01 

E0 

7.9 

23x20 

3.9 

NGC5194 = M51 

13 

29.9 

47 

12 

Sc 

8.9 

11 x6 

4.3 

NGC 5236 = M83 

13 

37.0 

-29 

52 

Sc 

7.0 

13 x 12 

2.4 

NGC 5457 = M101 

14 

03.2 

54 

21 

Sc 

8.2 

23x21 

4.3 

NGC 6822 

19 

45.0 

-14 

48 

Irr 

9.2 

20 x 10 

0.7 
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„ Abbreviation 

Latin name 

Genitive 

English name 

column gives the And 

Andromeda 

Andromedae 

Andromeda 

abbreviation of the Latin Ant 

Antlia 

Antliae 

Air Pump 

name used to form star 

Aps 

names r 

Apus 

Apodis 

Bird of Paradise 

Aql 

Aquila 

Aquilae 

Eagle 

Aqr 

Aquarius 

Aquarii 

Water-bearer 

Ara 

Ara 

Arae 

Altar 

Ari 

Aries 

Arietis 

Ram 

Aur 

Auriga 

Aurigae 

Charioteer 

Boo 

Bootes 

Bootis 

Herdsman 

Cae 

Caelum 

Caeli 

Chisel 

Cam 

Camelopardalis 

Camelopardalis 

Giraffe 

Cnc 

Cancer 

Cancri 

Crab 

CMa 

Canis Major 

Canis Majoris 

Great Dog 

CMi 

Canis Minor 

Canis Minoris 

Little Dog 

Cap 

Capricornus 

Capricomi 

Sea-goat 

Car 

Carina 

Carinae 

Keel 

Cas 

Cassiopeia 

Cassiopeiae 

Cassiopeia 

Cen 

Centaurus 

Centauri 

Centaurus 

Cep 

Cepheus 

Cephei 

Cepheus 

Cet 

Cetus 

Ceti 

Whale 

Cha 

Chamaeleon 

Chamaeleontis 

Chameleon 

Cir 

Circinus 

Circini 

Compasses 

Col 

Columba 

Columbae 

Dove 

Com 

Coma Berenices 

Comae Berenices 

Berenice’s Hair 

CrA 

Corona Austrina 

Coronae Austrinae 

Southern Crown 

CrB 

Corona Borealis 

Coronae Borealis 

Northern Crown 

Crv 

Corvus 

Corvi 

Crow 

Crt 

Crater 

Crateris 

Cup 

Cru 

Crux 

Cracis 

Southern Cross 

CVn 

Canes Venatici 

Canum Venaticorum 

Hunting Dogs 

Cyg 

Cygnus 

Cygni 

Swan 

Del 

Delphinus 

Delphini 

Dolphin 

Dor 

Dorado 

Doradus 

Swordfish 

Dra 

Draco 

Draconis 

Dragon 

Equ 

Equuleus 

Equulei 

Little Horse 

Eri 

Eridanus 

Eridani 

Eridanus 

For 

Fornax 

Fomacis 

Furnace 

Gem 

Gemini 

Geminoram 

Twins 

Gru 

Gras 

Gruis 

Crane 

Her 

Hercules 

Herculis 

Hercules 

Hor 

Horologium 

Horologii 

Clock 

Hya 

Hydra 

Hydrae 

Water Serpent 

Hyi 

Hydras 

Hydri 

Water Snake 

Ind 

Indus 

Indi 

Indian 
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Table C.23 (Continued) 

Abbreviation 

Latin name 

Genitive 

English name 

Lac 

Lacerta 

Lacertae 

Lizard 

Leo 

Leo 

Leonis 

Lion 

Lep 

Lepus 

Leporis 

Hare 

Lib 

Libra 

Librae 

Scales 

LMi 

Leo Minor 

Leonis Minoris 

Little Lion 

Lup 

Lupus 

Lupi 

Wolf 

Lyn 

Lynx 

Lyncis 

Lynx 

Lyr 

Lyra 

Lyrae 

Lyre 

Men 

Mensa 

Mensae 

Table Mountain 

Mic 

Microscopium 

Microscopii 

Microscpe 

Mon 

Monoceros 

Monocerotis 

Unicorn 

Mus 

Musca 

Muscae 

Fly 

Nor 

Norma 

Normae 

Square 

Per 

Perseus 

Persei 

Perseus 

Phe 

Phoenix 

Phoenicis 

Phoenix 

Pic 

Pictor 

Pictoris 

Painter 

PsA 

Piscis Austrinus 

Piscis Austrini 

Southern Fish 

Psc 

Pisces 

Piscium 

Fishes 

Pup 

Puppis 

Puppis 

Poop 

Pyx 

Pyxis 

Pyxidis 

Compass 

Ret 

Reticulum 

Reticuli 

Net 

Scl 

Sculptor 

Sculptoris 

Sculptor 

Sco 

Scorpius 

Scorpii 

Scorpion 

Set 

Scutum 

Scuti 

Sobieski's Shield 

Ser 

Serpens 

Serpentis 

Serpent 

Sex 

Sextans 

Sextan tis 

Sextant 

Sge 

Sagitta 

Sagittae 

Arrow 

Sgr 

Sagittarius 

Sagittarii 

Archer 

Tau 

Taurus 

Tauri 

Bull 

Tel 

Telescopium 

Telescopii 

Telescope 

TrA 

Triangulum Australe 

Trianguli Australis 

Southern Triangle 

Tri 

Triangulum 

Trianguli 

Triangle 

Tuc 

Tucana 

Tucanae 

Toucan 

UMa 

Ursa Major 

Ursae Majoris 

Great Bear 

UMi 

Ursa Minor 

Ursae Minoris 

Little Bear 

Vel 

Vela 

Velorum 

Sails 

Vir 

Virgo 

Virginis 

Virgin 

Vol 

Volans 

Volantis 

Flying Fish 

Vul 

Vulpecula 

Vulpeculae 

Fox 

Oct 

Octans 

Octantis 

Octant 

Oph 

Ophiuchus 

Ophiuchi 

Serpent-bearer 

Ori 

Orion 

Orionis 

Orion 

Pav 

Pavo 

Pavonis 

Peacock 

Peg 

Pegasus 

Pegasi 

Pegasus 
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Table C.24 Largest 
optical telescopes. 

D = diameter of the mirror 


Telescope 

Location 

Completion 

year 

D 

[m] 

GranTeCan 

La Palma 

2009 

10.4 

William M. Keck Telescope I 

Mauna Kea, Hawaii 

1992 

10 

William M. Keck Telescope II 

Mauna Kea, Hawaii 

1996 

10 

Southern African Large Telescope 

Sutherland, South Africa 

2005 

10 

Subaru Telescope 

Mauna Kea, Hawaii 

1999 

8.3 

Large Binocular Telescope 1 

Mt. Graham. Arizona 

2005 

8.4 

Kueyen Telescope (VLT 2) 

Cerro Paranal, Chile 

1999 

8.2 

Melipal Telescope (VLT 3) 

Cerro Paranal. Chile 

2000 

8.2 

Yepun Telescope (VLT 4) 

Cerro Paranal. Chile 

2000 

8.2 

Gemini North Telescope 

Mauna Kea, Hawaii 

1999 

8.1 

Gemini South Telescope 

Cerro Pachon. Chile 

2000 

8.1 

Multi-Mirror Telescope 

Mt. Hopkins, Arizona 

1999 

6.5 

Walter Baade (Magellan 1 Telescope) 

Las Campanas, Chile 

2000 

6.5 

Landon Clay (Magellan 2 Telescope) 

Las Campanas, Chile 

2002 

6.5 


Table C.25 Largest parabolic radio telescopes. D = diameter of the antenna, A. m i„ = shortest wavelength 




Completion 

year 

D 

[m] 

7. min 

[cm] 

Remarks 

Arecibo 

Puerto Rico, USA 

1963 

305 

5 

Fixed disk; limited tracking 

Green Bank 

West Virginia, USA 

2001 

100 x 110 

0.3 

The largest fully steerable telescope 

Effelsberg 

Bonn, Germany 

1973 

100 

0.8 


Jodrell Bank 

Macclesfield, Great Britain 

1957 

76.2 

10-20 

First large paraboloid antenna 

Jevpatoria 

Crimea 

1979 

70 

1.5 


Parkes 

Australia 

1961 

64 

2.5 

Innermost 17 m of dish can be used 
down to 3 mm wavelengths 

Goldstone 

California, USA 


64 

1.5 

Belongs to NASA deep space network 

Tidbinbilla 

Australia 


64 

1.3 

NASA 

Madrid 

Spain 


64 

1.3 

NASA 
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Table C.26 Millimetre and submillimetre telescopes and interferometers, h = altitude above sea level, D = diameter 
of the antenna, 7. m i„ = shortest wavelength 


Institute 

Location 

h 

[m] 

D 

[m] 

7 m i n 
[mm] 

Remarks; 
operational since 

NRAO, VLA 

New Mexico, USA 

2124 

25 

7 

27 antennas 
r/max = 36.6 km 
1976 

NRAO, VLBA 

[USA 

16-3720 

25 

13 

10 antennas 

1988-1993 

Max-Planck-lnstitut fur Radioastronomie & 
University of Arizona 

Mt. Graham, USA 

3250 

10 

0.3 

1994 

California Institute of Technology 

Mauna Kea, Hawaii 

4100 

10.4 

0.3 

1986 

Science Research Council England & Holland 

Mauna Kea, Hawaii 

4100 

15.0 

0.5 

The James 

Clerk Maxwell 
Telescope 

1986 

California Institute of Technology 

Owens Valley, USA 

1220 

10.4 

0.5 

3 antenna 
interferometer 

1980 

Sweden-ESO Southern Hemisphere Millimeter 
Antenna (SEST) 

La Silla, Chile 

2400 

15.0 

0.6 

1987 

Institut de Radioastronomie Millinretrique 
(IRAM), France & Germany 

Plateau de Bure, France 

2550 

15.0 

0.6 

3 antenna 

interferometer 

1990; 

fourth antenna 

1993 

IRAM 

Pico Veleta, Spain 

2850 

30.0 

0.9 

1984 

National Radio Astronomy Observatory 
(NRAO) 

Kitt Peak, USA 

1940 

12.0 

0.9 

1983 (1969) 

University of Massachusetts 

New Salem, USA 

300 

13.7 

1.9 

radom 

1978 

University of California, Berkeley 

Hat Creek Observatory 

1040 

6.1 

2 

3 antenna 
interferometer 

1968 

Purple Mountain Observatory 

Nanjing, China 

3000 

13.7 

2 

radom 

1987 

Daeduk Radio Astronomy Observatory 

Soul, South-Korea 

300 

13.7 

2 

radom 

1987 

University of Tokyo 

Nobeyama, Japan 

1350 

45.0 

2.6 

1982 

University of Tokyo 

Nobeyama, Japan 

1350 

10.0 

2.6 

5 antenna 
interferometer 

1984 

Chalmers University of Technology 

Onsala, Sweden 

10 

20.0 

2.6 

radom 

1976 
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Table C.27 Some important astronomical satellites and space probes 1980-2002 


Satellite 


Launch date 

Target 

Solar Max 

USA 

Feb. 14, 1980 

Sun 

Venera 13 

SU 

Oct. 30, 1981 

Venus 

Venera 14 

SU 

Nov. 4, 1981 

Venus 

IRAS 

USA 

Jan. 25, 1983 

infrared 

Astron 

SU 

Mar. 23, 1983 

ultraviolet 

Venera 15 

SU 

Jun. 2, 1983 

Venus 

Venera 16 

SU 

Jun. 7, 1983 

Venus 

Exosat 

ESA/USA 

May 26, 1983 

X-ray 

Vega 1 

SU 

Dec. 15 1984 

Venus/Halley 

Vega 2 

SU 

Dec. 21, 1984 

Venus/Halley 

Giotto 

ESA 

Jul. 2, 1985 

Halley 

Suisei 

Japan 

Aug. 18, 1985 

Halley 

Ginga 

Japan 

Feb. 5, 1987 

X-ray 

Magellan 

USA 

May 4, 1989 

Venus 

Hipparcos 

ESA 

Aug. 8, 1989 

astrometry 

COBE 

USA 

Nov. 18, 1989 

cosmic background radiation 

Galileo 

USA 

Oct. 18, 1989 

Jupiter etc. 

Granat 

SU 

Dec. 1, 1989 

gamma ray 

Hubble 

USA/ESA 

Apr. 24, 1990 

UV, visible 

Rosat 

Germany 

Jun. 1, 1990 

X-ray 

Gamma 

SU 

Jul. 11, 1990 

gamma ray 

Ulysses 

ESA 

Oct. 6, 1990 

Sun 

Compton 

USA 

Apr. 5, 1991 

gamma ray 

EUVE 

USA 

Jun. 7, 1992 

extreme UV 

Asuka 

Japan 

Feb. 20, 1993 

X-ray 

Clementine 

USA 

Jan. 25, 1994 

Moon 

ISO 

ESA 

Nov. 17, 1995 

infrared 

SOHO 

ESA 

Dec. 2, 1995 

Sun 

Near-Shoemaker 

USA 

Feb. 17, 1996 

Mathilde, Eros 

BeppoSAX 

Italy 

Apr. 30, 1996 

X-ray 

Mars Global Surveyor 

USA 

Nov. 7, 1996 

Mars 

Cassini/Huygens 

USA/ESA 

Oct. 15, 1997 

Saturn, Titan 

Mars Pathfinder/Sojourner 

USA 

Dec. 4, 1996 

Mars 

Lunar Prospector 

USA 

Jan. 6, 1998 

Moon 

Nozomi 

Japan 

Jul. 4, 1998 

Mars 

Deep Space 1 

USA 

Oct. 24, 1998 

Braille, Borrelly 

Stardust 

USA 

Feb. 7, 1999 

Wild 2 

Chandra 

USA 

Jul. 23, 1999 

X-ray 

XMM-Newton 

ESA 

Dec. 10, 1999 

X-ray 

Hete 2 

USA 

Oct. 9, 2000 

gamma ray 

Mars Odyssey 

USA 

Apr. 7, 2001 

Mars 

MAP 

USA 

Jun. 30, 2001 

cosmic background radiation 
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Table C.27 (Continued) 


Satellite 


Launch date 

Target 

Genesis 

USA 

Aug. 8, 2001 

solar particles 

RHESSI 

USA 

Feb. 5, 2002 

Sun 

Grace 

Gemany-USA 

Mar. 17, 2002 

Earth's gravity 

Integral 

ESA 

Oct. 17, 2002 

gamma ray 

Galex 

USA 

Apr. 28, 2003 

galaxies 

Hayabusa 

Japan 

May 9, 2003 

Itokawa 

Mars Express 

ESA 

Jun. 2, 2003 

Mars 

Spirit 

USA 

Jun. 10, 2003 

Mars 

Opportunity 

USA 

Jul. 8, 2003 

Mars 

Spitzer 

USA 

Jun. 10, 2003 

infrared 

Smart-1 

ESA 

Sep. 28, 2003 

Moon 

Rosetta 

ESA 

Mar. 2, 2004 

Churyumov-Gerasimenko 

Gravity Probe B 

USA 

Apr. 20, 2004 

relativity 

Messenger 

USA 

Aug. 3, 2004 

Mercury 

Swift 

USA 

Nov. 20, 2004 

gamma ray bursts 

Deep Impact 

USA 

Jan. 12, 2005 

Tempel 1 

Mars Recon Orbiter 

USA 

Aug. 12, 2005 

Mars 

Venus Express 

ESA 

Nov. 9, 2005 

Venus 

New Horizons 

USA 

Jan. 19, 2006 

Pluto 

Akari 

Japan 

Feb. 22, 2006 

infrared 



Answers to Exercises 


Chapter 2 

2.1 The distance is 7640 km, the northern¬ 
most point is 79°N, 45°W, in North Greenland, 
1250 km from the North Pole. 

2.2 The star can culminate south or north of 
zenith. In the former case we get 8 = 65°, </; = 
70°, and in the latter 8 = 70°, (p = 65°. 

2.3 (a) 4> > 58°7'. If refraction is taken into ac¬ 
count, the limit is 57°24'. (b)</> = «$ = 7°24'. 
(c) -59° 10' <(/><—0°50'. 

2.4 Pretty bad. 

2.5 Xq = 70°22', fi Q = 0°0\ A® = 250°22', 
/ 6 © = 0 ° 0 '. 

2.6 (c) ®o = 18 h. 

2.8 a = 6 h 45 min 9 s, 8 = -16°43'. 

2.9 u t = 16.7 kms -1 , v = 18.5 kms -1 , after ~ 
61,000 years, /x = 1.62" per year, parallax 0.42". 

Chapter 3 

3.1 (a) The flux density in the focal plane as well 
as the exposure time are proportional to ( D/f ) 2 . 
Thus the required exposure is 3.2 s. (b) 1.35 cm 
and 1.80 cm. (c) 60 and 80. 


3.2 (a) 0.001" (note that the aperture is a line 
rather than circular; therefore the coefficient 1.22 
should not be used), (b) 140 m. 

Chapter 4 

4.1 0.9. 

4.2 The absolute magnitude will be —17.5 and 
apparent 6.7. 

4.3 N(m + 1 )/N(m) = 10 3 / 5 = 3.98. 

4.4 r — 2.1 kpc, Eg-v = 0.7, and ( B — V)o = 
0.9. 

4.5 (a) Am — 1.06 mag, m = 2.42. (b) r = 
— In0.85 6 «« 0.98. 

Chapter 5 

5.2 n — 166, which corresponds to A = 21.04 cm. 
Such transitions would keep the population of the 
state n — 166 very high, resulting in downward 
transitions also from this state. Such transitions 
have not been detected. Hence, the line is pro¬ 
duced by some other process. 

5.3 If we express the intensity as something per 
unit wavelength, we get A max =1.1 mm. If the 
intensity is given per unit frequency, we have 
A max =1.9 mm. The total intensity is 2.6 x 
10 13 W m -2 sterad -1 . At 550 nm the intensity is 
practically zero. 
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5.4 (a) L — 1.35 x 10 29 W. The flux in the given 
interval can be found by integrating Planck’s 
law numerically. Using the Wien approximation 
a rather complicated expression can be derived. 
Both methods give the result that 3.3 % of the ra¬ 
diation is in the visual range, thus Ly — 4.45 x 
10 27 W. (b) At a distance of 10 pc the observed 
flux density is 3.7 x 10 -9 Wm -2 . (b) 10.3 km. 

5.5 M(,oi = 0.87, whence R = 2.0 Rq. 

5.6 T = 1380 K. There are several strong ab¬ 
sorption lines in this spectral region, reducing the 
brightness temperature. 

5.8 i> rms ~ 6700 km s _ 1 . 

Chapter 6 

6.1 u a /Vp = (1 — e)/(l + e). For the Earth this 
is 0.97. 

6.2 a = 1.4581 AU, 23.6 kms” 1 . 

6.3 The period must equal the sidereal rotation 
period of the Earth, r = 42,339 km = 6.64/?©. 
Areas within 8.6° from the poles cannot be seen 
from geostationary satellites. The hidden area is 

1.1 % of the total surface area. 

6.4 p = 3n/(GP 2 {a/2) i ) « 1400 kgra" 3 . 

6.5 M = 90°, E = 90.96°, / = 91.91°. 

6.6 The orbit is hyperbolic, a = 3.55 x 10 7 AU, 
e = 1 + 3.97 x 10“ 16 , r p — 2.1 km. The comet 
will hit the Sun. 

6.7 The orbital elements of the Earth calculated 
from Table C.12 are a = 1.0000, e = 0.0167, 
/ = 0.0004°, Q = -11.13°, zu = 102.9°, L = 
219.5°. The geocentric radius vector of the Sun 
in the ecliptic coordinates is 

/0.7583\ 
r = 0.6673 . 

V 0.0 ) 


The corresponding equatorial radius vector is 

/0.7583\ 
r = 0.6089 , 

\0.2640/ 

which gives a 2 h 35 min 3 s, 8 ~ 15.19°. 
The exact direction is a = 2 h 34 min 53 s, <5 = 
15.17°. 

Chapter 7 

7.1 Assuming the orbits are circular, the greatest 
elongation is arcsin(a/l AU). For Mercury this 
is 23° and for Venus 46°. The elongation of a su¬ 
perior planet can be anything up to 180°. The sky 
revolves about 15° per hour, and thus correspond¬ 
ing times for Mercury and Venus are 1 h 30 min 
and 3 h 5 min, respectively. In opposition Mars is 
visible the whole night. These values, however, 
depend on the actual declinations of the planets. 

7.2 (a) 8.7°. (b) The Earth must be 90° from 
the ascending node of Venus, which is the situ¬ 
ation about 13 days before vernal and autumnal 
equinoxes, around March 8 and September 10. 

7.3 (a) If the radii of the orbits are a \ and « 2 , the 
angular velocity of the retrograde motion is 

dk _ VGM 
df V fl i fl 2(VoT + JaP)' 

(b) In six days Pluto moves about 0.128° corre¬ 
sponding to 4 mm. For a main belt asteroid the 
displacement is almost 4 cm. 

7.4 If the orbital velocity of the planet is v the 
deviation in radians is 

v 1 lGM e 

a=- = -J -• 

c c V a 

This is greatest for Mercury, a = 0.00016 rad = 
33". This planetary aberration must be taken into 
account when computing accurate ephemerides. 
The deviation is largest when the planet is in con¬ 
junction or opposition and moves almost perpen¬ 
dicularly to the line of sight. 
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7.5 p = 0.11, q = 2, and A = 0.2. In reality the 
Moon reflects most of the light directly back¬ 
wards (opposition effect), and thus q and A are 
much smaller. 

7.6 Am = 0.9. The surface brightness remains 
constant. 

7.7 The absolute magnitude is V(1,0) = 23. 
(a) m = 18.7. (b) m — 14.2. At least a 15 cm tele¬ 
scope is needed to detect the asteroid even one 
day before the collision. 


Planet 

Nearest 

moon 

Radius 

of the 
planet 

Radius of 

moon’s 
orbit [km] 

Ratio 

Earth 

Moon 

6378 

384,400 

60 

Mars 

Phobos 

3396 

9377 

2.8 

Jupiter 

Metis 

71,492 

127,690 

1.8 

- 

rengas 

- 

122,500 

1.7 

Saturn 

Pan 

60,268 

133,584 

2.2 

- 

ring 

- 

92,000 

1.5 

Uranus 

Cordelia 

25,559 

49,751 

1.9 

- 

ring 

- 

44,718 

1.7 

Neptune 

Naiad 

24,764 

48,227 

1.9 

- 

ring 

- 

63,000 

2.5 


7.8 Using the values of Appendix C, the gravita¬ 
tional acceleration is 9.865 m/s 2 at the poles and 
9.799 m/s 2 on the equator. On the equator then 
rotational velocity is 464 m/s and the centrifugal 
acceleration 0.034 m/s 2 ; thus the total accelera¬ 
tion is on 9.765 m/s 2 . 

Chapter 8 

8.1 P sid = 11.9 a, a = 5.20 AU, <7 = 144,000 km. 
Obviously the planet is Jupiter. 

8.2 (a) Hint: If there is a synodic period P there 
must be integers p and q such that (ni —n\)P — 
2np and (/13 — n\)P = 2nq. Sometimes one can 
see claims that the configuration of the whole 
planetary system will recur after a certain period. 
Such claims are obviously nonsense, (b) 7.06 d. 

8.3 Apply (7.1), where now Pi = 24.62 and 
P 2 = 30.30. Hence P U2 = 131.34h = 5d 11 h 
20 min. One revolution as seen from the surface 
of Mars takes 131.34 hours. Thus Deimos seems 
to move 360/131.34 = 2.74 degrees an hour. 

8.4 According to Example 8.1 the distance of 
the Roche limit is R ^ 2.5 x 7 Z, where 1Z is the 
radius of the planet. The following table gives the 
distances of the nearest moon and the inner edge 
of the brightest ring of each planet. 


The nearest satellites and inner edges of rings 
are either close to the Roche limit or inside it. 
The Earth’s Moon is an exception in this re¬ 
spect. 

8.5 The densest hexagonal packing gives a vol¬ 
ume density of 

7t 2r 7T 

^ x —=- = —— « 0.74. 

3V3 VW^r 3V2 

The mass of hydrogen atoms that could be packed 
in one cubic metre is (1/5.5 x 10 ~ n ) 3 x 1.67 x 
10 - 27 x 0.74 = 7428 kg. The density of Jupiter 
calculated this way would be 7428 kg/m 3 . The 
actual density is only 1326 kg/m 3 . 

Chapter 9 

9.1 c g a d f e b; the actual spectral classes from 
top to bottom are A0, M5, 06, F2, K5, G2, B3. 

Chapter 10 

10.1 The period is P — 1 j -J2 years and the rela¬ 
tive velocity 42,100 ms -1 . The maximum sepa¬ 
ration of the lines is 0.061 nm. 

10.3 Substituting the values to the equation of 
Example 10.1 we get an equation for a. The so¬ 
lution is a = 4.4 AU. The mass of the planet is 
0.0015 M q . 
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Chapter 11 

11.1 10.5. 

11.2 (a) 9.5 x I O ’ 7 s _1 . (b) The neutrino pro¬ 
duction rate is 1.9 x 10 38 s _1 , and each second 
9 x 10 28 neutrinos hit the Earth. 

11.3 The mean free path is 1 / xp ~ 42,000 AU. 

Chapter 12 

12.1 fff = 6.7 x 10 5 a. Stars are born at the rate 
0.75 Mq per year. 

12.2 t t 400,000 a, t n 02 3 x 10 8 a. 

12.3 About 900 million years. 

Chapter 13 

13.1 (a) 6.3 x 10 7 Wm -2 . (b) 16 m 2 . 

13.2 807 Wm“ 2 . 

Chapter 14 

14.1 dr/r = —0.46, dM — 0.14. 

14.2 (a) T = 3570 K. (b) R mm /R m „ = 0.63. 

14.3 (a) 1300 pc. (b) 860 years ago; due to 
inaccuracies a safe estimate would be 860 ± 
100 years. Actually, the explosion was observed 
in 1054. (c) -7.4. 

Chapter 15 

15.1 L — 2.3 x 10 40 kgm 2 s -1 , di? = 45 m. 

15.2 (a) M = 0.5 Mq, a = 0.49 x 10 9 ms " 2 « 
4 x 10 7 g. (b) A standing astronaut will be subject 
to a stretching tidal force. The gravitational ac¬ 
celeration felt by the feet is 3479 ms -2 ss 355y 
larger than that felt by the head. If the astronaut 
lies tangentially, (s)he will experience a com¬ 
pressing force of 177 g. 


15.3 v = v e (l — GM/(Rc 2 )). If Av/v is small, 
we have also AA, = {GM / Rc 2 )X e . A photon 
emitted from the Sun is reddened by 2.1 x 
10 _ 6 A e . In yellow light (550 nm) the change is 
0.0012 nm. 

Chapter 16 

16.1 2.6 kpc and 0.9 kpc, a — 1.5 mag/kpc. 

16.2 7 kms -1 . 

16.3 The velocity of the proton is v — 0.0462c = 
1.38 x 10 7 ms -1 . The radius of the orbit is r = 
mv/qB — 0.01 AU. 

Chapter 17 

17.1 7.3. 

17.2 The potential energy is approximately U — 
—G(m 2 n 2 /(2R)), where m is the mass of one 
star, n the number of stars (there are n(n — 
l)/2 & n 2 /2 pairs), and R the radius of the clus¬ 
ter. The average velocity is oa Gmn/(2R ) = 
0.5 kms -1 . 

Chapter 18 

18.1 /x = 0.0055" a” 1 . 

18.2 (a) 5.7 kpc. (b) 11 kpc. Possible reasons 
for the discrepancy include: (1) The distance is 
so large that the approximations used for deriv¬ 
ing Oort’s formulae are not very good. (2) Taking 
into account the interstellar extinction will reduce 
the distance in (b). (3) The peculiar velocity of the 
star was neglected. 

18.3 (a) 3 (and the Sun), (b) The number is of the 
order of 100,000. This is a typical selection ef¬ 
fect: bright stars are rare but they are visible overt 
long distances. 

18.4 (a) If the thickness of the disk is //, the light 
has to travel a distance s = min{r, (H/2)secb} in 
the interstellar medium. Thus the magnitude will 
be m = M + 51g(r/10 pc) + as. (b) s — 200 pc, 
m = 10.2. 
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Chapter 19 

19.1 (a) 26, (b) 25. 

19.2 The diameter must be of the order one light- 
week « 1200 AU. M = -23.5. If this is the 
bolometric magnitude, the luminosity is L ss 2 x 
10 11 Lq, corresponding to 210 Lq AU -3 . 

Chapter 20 

20.1 r = v/H = 93 Mpc (if H — 75), diameter 
is 35 kpc, M — —20.4. Potential sources of error 
include: (1) inaccuracy of the Hubble constant, 
(2) peculiar velocity of the galaxy, (3) intergalac- 
tic extinction, (4) only 2-dimensional projection 
is observed, and the edge depends on the limiting 
magnitude used. 

20.2 If H = 50 kins -1 Mpc -1 , r = 51 Mpc, or 
74 times the distance of M31. If H is doubled, 
the distance is reduced to half of that, but is still 
higher than the value obtained in Exercise 19.1. 
A possible explanation is the peculiar velocity of 
the galaxy. 

20.3 m v = 1.5 x 10~ 35 kg, or 0.00002 times the 
mass of the electron. 

Chapter 21 

21.1 If the greenhouse effect is neglected, the 
distances corresponding to temperatures 373 K 
and 273 K are 0.47 AU and 0.87 AU, respec¬ 
tively. The effective temperature of the young Sun 
was T = f/0J x 5785 K = 5291 K. The corre¬ 


sponding limits then were 0.39 AU and 0.73 AU. 
Thus the continuously habitable zone would ex¬ 
tend from 0.47 to 0.73 AU. 

21.2 If there are n stars in a volume V, the dis¬ 
tance between the nearest neighbours is of the or¬ 
der y/V/n. If binary and triple stars in Table C. 17 
are counted as single objects, there are 47 stars 
in a volume of 520 cubic parsecs, (a) If a frac¬ 
tion 0.01 of them is habited, the average distance 
between neighbouring civilisations is 10 parsecs; 
(b) 100 parsecs. 

21.3 Since comets are mostly dirty ice, the den¬ 
sity is close to that of water. Thus the mass of the 
comet is about 5 x 10 8 kg. A freely falling ob¬ 
ject will reach a velocity that is the same as the 
escape velocity from the Earth, 11 km/s. The ki¬ 
netic energy of the comet is at least 3 x 10 16 J, 
corresponding to 7.6 megatons of TNT or 500 Hi¬ 
roshima bombs. 

Chapter 22 

22.1 The centre of gravity of the Sun and Jupiter 
is about 743,000 km from the centre of the Sun, 
i.e. somewhat outside the solar surface. From a 
distance of 10 pc this subtends an angle of 0.0005 
arcsecond. Thus the total amount of oscillation is 
one milliarcsecond. 

22.2 Jupiter covers a fraction 0.0106 of the so¬ 
lar disc, corresponding to 0.0115 magnitudes. For 
the Earth the change is 0.0001 magnitudes. 
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Chapter 2. Spherical Astronomy 

Plate 1 A long exposure shows the apparent 
motion of the celestial sphere. Pekka Parviainen 
has caught in his image the star trails, a trail of 
the crescent moon, and a trail of the rising Sun, 
taken through a 24 stop ND-filter. (Photo www. 
polarimage.fi) 

Plate 2 Atmospheric refraction has distorted 
and discoloured the image of the setting Sun. 
A rare green segment can be seen above the Sun. 
(Photo Pekka Parviainen) 

Chapter 3. Observations and Instruments 

Plate 3 Astronomical observatories are built in 
high, dry places with a clear sky. The picture 
shows the summit of Mauna Kea in Hawaii with 
the twin domes of the world’s largest telescopes, 
the William M. Keck Telescopes. Each of them 
has a mosaic mirror with a diameter of 10 meters. 
(Photo W.M. Keck Observatory) 

Plate 4 The next generation space telescope, 
James Webb, is under construction in the U.S. It 
will be launched about 2013. It will have a 6.5 
meter mirror made of 18 segments. The telescope 
will be situated at the L2 point of the Sun-Earth 
system, about 1.5 million kilometers from the 
Earth. (Illustration NASA) 


Chapter 7. The Solar System 

Plate 5 The transits of Mercury over the disk 
of the Sun were widely observed in 1999 and 

2003. The planet is nearing the limb of the Sun at 
the end of the latter transit. The photograph was 
taken with the 1 meter Swedish solar telescope 
on La Palma. (Photo Royal Swedish Academy of 
Sciences) 

Plate 6 Eclipses are astronomical phenomena 
that tend to attract the attention of the general 
public. A composite made from five different ex¬ 
posures shows the extent of the Earth’s shadow 
and the movement of the Moon from lower right 
to upper left. The eclipse occurred in October 

2004. (Photo Pekka Parviainen) 

Plate 7 The first and so far only colour pic¬ 
tures from the surface of Venus were obtained in 
1982. Panoramic views from Venera 13 {above) 
and Venera 14 {below). At the edges, the cam¬ 
era is looking at the horizon and in the centre, 
at the ground around the base of the space probe. 
Parts of the base, the protective covers of the cam¬ 
era and a colour map can be seen. (Photo Soviet 
Academy of Science) 

Plate 8 One of the largest impact craters on 
the surface of the Earth, Lake Manicouagan in 
northern Canada. The ancient impact structure 
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is about 70 kilometers in diameter. This natural- 
colour image of the crater was acquired by Nasa’s 
Terra satellite in June 2001. The impact is thought 
to have occurred about 212 million years ago. 
(Photo NASA/JPL/MISR) 

Plate 9 The last people to have walked on the 
Moon were geologist Harrison Schmitt (in the 
picture) and Eugene Cernan, who took the photo. 
Schmitt is studying a big boulder at the Taurus- 
Littrow landing site of Apollo 17 spacecraft in 
December 1972, with the lunar rover in the fore¬ 
ground. (Photo NASA) 

Plate 10 The “Burns Cliff” in the “Endurance 
Crater” on Mars. The photo was taken by the Op¬ 
portunity Mars rover in November 2004. Oppor¬ 
tunity and its twin Spirit have studied the surface 
of Mars in unprecedented detail since their land¬ 
ing in early 2002. The mosaic spans about 180 
degrees from side to side. (Photo NASA) 

Plate 11 Colour-enhanced picture of the surface 
of Europa. The surface is criss-crossed by linea¬ 
ments, and the colour differences show that they 
are younger than the surrounding areas. The sur¬ 
face is mainly water ice. (Photo Galileo/NASA) 

Plate 12 A true colour mosaic of Jupiter was 
taken by Cassini spacecraft in 2000, when Cassini 
made its closest approach to Jupiter on its way 
to Saturn. The clouds of the giant planet move 
in reddish-brown and white bands, with the Great 
Red Spot and several smaller white ovals in view. 
The smallest visible features are about 60 kilome¬ 
ters across. (Photo NASA) 

Plate 13 A simulated image of the rings of Sat¬ 
urn, based on radio occultation measurements 
by Cassini spacecraft in May 2005. Green and 
blue shades indicate regions where the particles 
are smaller than 5 centimeters and 1 centimeter 
across. In purple regions, the particles are mostly 
larger than 5 centimeters. The white area is the 
thickest part of B-ring, where radio signals were 
blocked. (Photo NASA) 

Plate 14 The first view from the surface of Ti¬ 
tan, the largest moon of Saturn, was obtained in 


January 2005, when the European Huygens probe 
made a soft landing. The boulders are made of 
water ice, and their size is mostly 10-20 centime¬ 
ters. The surface itself is methane-filled mud and 
sand. (Photo ESA/NASA) 

Plate 15 Details of the icy surface of Triton, the 
largest satellite of Neptune. The plain in the cen¬ 
ter is called Tuonela. The albedo of Triton is very 
similar to that of Pluto, and thus this kind of fea¬ 
ture may be seen by the New Horizons spacecraft 
in July 2015 when it flies by Pluto. (Photo Voy¬ 
ager 2/NASA) 

Plate 16 Surface and jets of Comet Wild 2. The 
Stardust spacecraft flew by the head of comet 
Wild 2 in January 2004. The best black-and-white 
photo of the comet’s surface has been merged 
with a long-exposure photo showing the faint dust 
jets emanating from the surface. The diameter of 
the comet’s head is about 5 km. (Photo NASA) 

Chapter 9. Stellar Spectra 

Plate 17 Spectra of stars of different spectral 
classes. The wavelength range is in the visual 
part, from 400 to 700 nanometers. The O stars at 
the top are hottest, and the M stars at the bottom, 
the coolest. (Photo NOAO/AURA/NSF) 

Chapter 13. The Sun 

Plate 18 Three-colour composite of the solar 
surface, photographed by the Extreme Ultravio¬ 
let Imaging Telescope of SOHO satellite in May 
1998. Three ultraviolet images taken at differ¬ 
ent wavelengths (17.1 nm, 19.5 nm and 28.4 nm) 
were coded red, yellow and blue and merged into 
one picture. (Photo SOHO/EIT/NASA/Goddard 
Space Flight Center) 

Chapter 16. The Interstellar Medium 

Plate 19 One of the most detailed visual-wave- 
length pictures of the Orion Nebula was pub¬ 
lished in 2006. It was taken by the Hubble Space 
Telescope, which used 105 orbits to photograph 
the nebula. The original picture has 1000 million 
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pixels. Thousands of previously unknown stars 
were found in the picture, and also many new 
brown dwarfs. Orion Nebula is a large dust and 
gas region, giving rise to new stars. The width of 
the picture is about half a degree (the same as full 
Moon). (Photo NASA/ESA/M. Robberto) 

Plate 20 Orion Nebula photographed by the 
Spitzer Telescope in infrared light in 2006. While 
the Hubble picture (Plate 19) shows bright gas, 
with dark dust lanes, Spitzer sees the dust as 
bright regions. The wavelength of 8 microme¬ 
ters has been coded red , and 5.8 microns, orange. 
They represent dust heated by the stars. Light 
of 4.5 microns (green) shows hot gas and dust, 
and light of 3.6 microns (blue) is from starlight. 
(Photo NASA) 

Plate 21 The dark cloud Barnard 68 in the con¬ 
stellation of Ophiuchus. B68 is a dense molecu¬ 
lar cloud, which is just starting to contract and 
later form new stars. The diameter of the cloud 
is 7 light-months (0.2 parsecs), and its distance is 
about 500 light-years. It was photographed by the 
Antu (VLT 1) Telescope in 1999. (Photo ESO) 

Plate 22 The star V838 Monocerotis brightened 
suddenly in early 2002. The outburst lasted sev¬ 
eral weeks and sent a sphere of light spread¬ 
ing into the surrounding interstellar medium. The 
light-echo has illuminated the dust clouds around 
the star and unveiled never-before-seen patterns. 
This photo was taken by the Hubble Space Tele¬ 
scope at the end of 2004, nearly three years after 
the outburst. (Photo NASA/ESA) 

Plate 23 The Rosette Nebula in Monoceros be¬ 
longs to the same extensive dust and gas complex 
as the Orion Nebula. Within the Rosette Nebula 
there is a young star cluster called NGC 2244, 
which gives energy to the cloud. The ultraviolet 
light from the stars makes the hydrogen glow red , 
and the oxygen, greenish. The picture is about 1 
degree wide, and it was taken by the MegaPrime 
Camera at the Canada-France-Hawaii Telescope 
on Mauna Kea in 2003. (Photo CFHT/J.-C. Cuil- 
landre) 


Plate 24 The Helix Nebula is one of the nearest 
planetary nebulae and also one of the apparently 
widest. Its diameter is nearly one degree (two full 
Moons side by side), but its surface brightness is 
so low it cannot be seen by the naked eye. It is lo¬ 
cated in the constellation of Aquarius. This com¬ 
posite photograph combines an ultra-sharp mo¬ 
saic by the Hubble Space Telescope with a wide 
view by the WIYN 0.9 meter Telescope on Kitt 
Peak. (Photo NASA/NOAO/ESA) 

Plate 25 Radio image of the Cassiopeia A su¬ 
pernova remnant. Cas A is one of the bright¬ 
est radio sources in the sky. This image was 
made at three different frequencies: 1.4 GHz, 
5.0 GHz and 8.4 GHz, with the Very Large Ar¬ 
ray radio telescope in New Mexico. The radiation 
comes mainly from relativistic electrons. (Image 
NRAO/AUI) 

Plate 26 The remnant of the Tycho supernova 
in X-rays. The supernova exploded in 1572 and 
was observed by Tycho Brahe. During the last 
years, the Chandra X-Ray Observatory has made 
observations of this object. The image shows an 
expanding bubble of extremely hot gas (in green 
and red) inside a shell of rapidly moving elec¬ 
trons (blue). (Image NASA/Chandra) 

Chapter 19. Galaxies 

Plate 27 Part of the Andromeda Galaxy M31, 
photographed with the Japanese Subaru 8.4 m 
telescope on Mauna Kea, Hawaii. The nucleus of 
the galaxy is outside the picture on the upper left. 
The stars with the vertical lines are foreground 
stars in our Milky Way. M31 is a sister galaxy 
to our Milky Way, with similar star clusters, dust 
and gas clouds and spiral arms. (Photo Subaru 
Telescope, NOAJ) 

Plate 28 The most detailed infrared view of 
the Andromeda Galaxy so far was obtained by 
the Spitzer Telescope in 2004. While the Sub¬ 
aru picture (Plate 27) shows the starlight from 
the galaxy, the Spitzer picture records the infrared 
radiation from the dust in the spiral arms at the 
wavelength of 24 micrometers. The spiral arms 
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with their warm dust and star-forming clouds can 
be seen much more clearly than at visual wave¬ 
lengths. (Photo Spitzer/NASA) 

Plate 29 The central part of the active galaxy 
Centaurus A, NGC 5128, seen by the Hubble 
Space Telescope. The whole galaxy is crossed by 
a thick dust lane with intricate wisps of warm 
dust and gas. At the centre of the galaxy, behind 
the dust shroud, lies a black hole, which sends 
powerful jets in two directions. The jets can be 
seen only in X-rays or at radio wavelengths. For 
a wider view of Centaurus A see the next Plate. 
(Photo NASA) 

Plate 30 New pictures of the outer parts of 
Centaurus A have revealed several ring-like fea¬ 
tures, which contain faint blue stars. The rings 
may have been formed in interactions with other, 
smaller galaxies, which have collided with Cen¬ 
taurus A and lost part of their stars to this giant 
galaxy. (Photo Cerro Tololo International Obser¬ 
vatory, NOAO/NSF) 

Plate 31 A well-known galaxy-quasar pair: in 
the center the galaxy NGC 4319, and on the up¬ 
per right, the quasar Markarian 215. Although the 
pair seems to belong together, in reality this is 
a case of chance alignment. The distance to the 
quasar is 14 times larger than the distance to the 
galaxy. (Photo Hubble Space Telescope/NASA) 


there seem to be six galaxies engaging in a cos¬ 
mic dance. The number of interacting galaxies is 
however only four. The small well-defined spiral 
galaxy to the right of the center, lies nearly five 
times further away than the others. And the right¬ 
most “galaxy” isn’t actually a galaxy at all, but 
a long “tidal tail” of stars torn off from the other 
galaxies. (Photo Hubble Space Telescope/NASA) 

Chapter 20. Cosmology 

Plate 33 The Hubble Space Telescope has pho¬ 
tographed several deep sky fields to get represen¬ 
tative samples of the Universe. This photograph is 
the Hubble Deep Field South in the constellation 
Tucana, and it contains thousands of never-before 
seen galaxies up to the distance of 12 Ga. The 
small red dot in the middle of the photograph is 
a distant quasar, whose light was studied with the 
Space Telescope Imaging Spectrograph, to map 
invisible clouds of hydrogen gas between us and 
the quasar. (Photo NASA) 

Plate 34 Temperature fluctuations in the cosmic 
microwave background. The observations were 
made with the Wilkinson Microwave Anisotropy 
Probe (WMAP) in 2002. From the scale depen¬ 
dence of the fluctuations we can estimate the av¬ 
erage density of the Universe (very near the crit¬ 
ical density), and the age of the Universe (about 
13.7 Ga). (Photo WMAP/NASA) 


Plate 32 A small group of interacting galax¬ 
ies called Seyfert’s Sextet. As the name implies, 
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